文件最后提交记录最后更新时间
[BACKEND] Allow backend to specify special rules for membar insertion (#4675) With block level kind of operations like TMA it is possible that some ops access the shared memory but don't require barriers. This adds a lambda that backends can pass to explicitly skip barriers in between some ops.1 年前
[release/3.2.x] [CHERRY PICK] Add gfx950 target definition (#5452) This PR brings in required LLVM bumps and additional targets for gfx950 support. - https://github.com/triton-lang/triton/pull/5040 - https://github.com/triton-lang/triton/pull/5064 - https://github.com/triton-lang/triton/pull/5180 - https://github.com/triton-lang/triton/pull/5242 - https://github.com/triton-lang/triton/pull/5392 Reverts: - #5347 - #51911 年前
Allow kernel instrumentation passes to be added to pipeline (#3953) This PR adds the ability to insert passes into the Triton LLVM pass pipeline through the LLVM Pass Plugin capability. This is technically an architecture agnostic patch but as of current AMDGPU is the only backend that would make use of it.1 年前