triton-ascend/lib/Dialect/TritonNvidiaGPU/Transforms · Ascend/triton-ascend - AtomGit

文件	最后提交记录	最后更新时间
CMakeLists.txt	[WS] Update aref ops and lower_aref pass (#7479)	10 个月前
FenceInsertion.cpp	Pawel/hoist unpipelineable operands (#7082)	11 个月前
InterleaveTMem.cpp	[TritonNvidiaGPU] Fix memory leak in TritonNvidiaGPU/Transforms/InterleaveTMem.cpp (#7924)	9 个月前
MMALowering.cpp	[NVIDIA] Add `is_async` flag to MMAv5 ops (#7590)	10 个月前
OptimizeDescriptorEncoding.cpp	[NFC] Use RankedTensorType's clone and cloneWithEncoding member functions (#7464)	10 个月前
OptimizeTMemLayouts.cpp	[BACKEND] Add optimization for local_load -> tmem_store layout (#7015)	9 个月前
PlanCTA.cpp	[NFC] Use RankedTensorType's clone and cloneWithEncoding member functions (#7464)	10 个月前
PromoteLHSToTMem.cpp	[LAYOUTS] Implement toLinearLayout for TensorMemoryEncodingAttr (#7748)	10 个月前
ProxFenceInsertion.cpp	[LAYOUTS] Implement generalized swizzling for convert_layout (#7565)	10 个月前
RemoveTMEMTokens.cpp	[NFC] Remove uses of deprecated `GEN_PASS_CLASSES` for `TritonNvidiaGPU/Transforms` (#6898)	1 年前
TMALowering.cpp	[BACKEND] Use immutable alloc for tma store (#7873)	9 个月前
TMAUtilities.cpp	Add support for padding option to TMA loads (#7993)	9 个月前
TensorMemoryAllocation.cpp	[Blackwell] Handle control flow in TMEM allocation (#7698)	10 个月前
Utility.cpp	[WS] Update aref ops and lower_aref pass (#7479)	10 个月前