| [AMD] Introduce specialized Allocation pass (#7328) | 11 个月前 |
| [IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876) | 9 个月前 |
| [Gluon] Actually enable scf+cf+arith canonicalizers (#7775) | 9 个月前 |
| [BACKEND] bump to llvm/llvm-project@e12cbd8 (#6880) | 1 年前 |
| [AMD] Reject async global to local ops for less than 4 bytes (#6734) | 1 年前 |
| [Dialect] Actually enable TMEM layout check and fix all the tests (#7723) | 10 个月前 |
| [Dialect] Actually enable TMEM layout check and fix all the tests (#7723) | 10 个月前 |
| [AMD] Fix alignment calculation for buffer ops offset in AxisAnalysis (#7886) | 9 个月前 |
| [AMD] Add fast_tanhf to libdevice (#7780) | 9 个月前 |
| [AMD] Enable lowerLocalLdSt for AMD path (#7355) | 10 个月前 |
| [LLs] Tree-reduce the xor reduction in LLs codegen (#7816) | 9 个月前 |
| [Dialect] Actually enable TMEM layout check and fix all the tests (#7723) | 10 个月前 |
| [AMD] Enable lowerLocalLdSt for AMD path (#7355) | 10 个月前 |
| [AMD] Improve register usage in Float8 conversions (#7527) | 10 个月前 |
| [AMD] Optimize transposed GEMM operand cases (#6074) | 1 年前 |
| [AMD] Support register broadcast in slice/concat ops (#7407) | 10 个月前 |
| [AMD] Support register broadcast in slice/concat ops (#7407) | 10 个月前 |
| [Dialect] Actually enable TMEM layout check and fix all the tests (#7723) | 10 个月前 |
| [Dialect] Actually enable TMEM layout check and fix all the tests (#7723) | 10 个月前 |
| Reland "byte permutes in intra-warp layout conversion" (#7933) | 9 个月前 |
| [AMD] Eanble NAN propagating MIN/MAX for gfx950 (#6387) | 1 年前 |
| [IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876) | 9 个月前 |
| [AMD] Use permlanex16 for shuffleXor on rdna (#7269) | 11 个月前 |
| [AMD] Enable lowerLocalLdSt for AMD path (#7355) | 10 个月前 |
| [Backend] Bump to llvm/llvm-project@bc773632355b (#7881) | 9 个月前 |