文件最后提交记录最后更新时间
[AMD] Add missing CMake dependency on TritonAMDGPUTableGen (#7824) The library TritonAMDAnalysis includes Dialect.h which in turn includes Dialect.h.inc. This means that for the library to build successfully, the tablegen target that produces Dialect.h.inc must run first. That target is TritonAMDGPUTableGen. However, TritonAMDAnalysis has no dependency on TritonAMDGPUTableGen resulting in spurious build breaks. This change adds the missing dependency. Fixes https://github.com/triton-lang/triton/issues/78219 个月前
[BACKEND] Localize the use and definition of getShapePerCTATile in the AMD backend and aim for elimination (#7740) 9 个月前
[BACKEND] Localize the use and definition of getShapePerCTATile in the AMD backend and aim for elimination (#7740) 9 个月前
Reland "byte permutes in intra-warp layout conversion" (#7933) Reland https://github.com/triton-lang/triton/pull/7809, https://github.com/triton-lang/triton/pull/7825, https://github.com/triton-lang/triton/pull/7861 Add a workaround for ptxas bug and add a regression test9 个月前
[AMD] Enable dot_scaled on gfx11 (#7954) I saw some occasional test failures, but after disabling True16, tests seem to pass fine. --------- Co-authored-by: Paul Trojahn <paul.trojahn@amd.com>9 个月前
[AMD][NFC] Extract range analysis into its own class (#5977) This PR factors TritonIntegerRangeAnalysis out of ConvertBufferOps into a standalone analysis that can be reused in other passes.1 年前