文件最后提交记录最后更新时间
[AMD] Introduce specialized Allocation pass (#7328)11 个月前
[IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876)9 个月前
[Gluon] Actually enable scf+cf+arith canonicalizers (#7775)9 个月前
[BACKEND] bump to llvm/llvm-project@e12cbd8 (#6880)1 年前
[AMD] Reject async global to local ops for less than 4 bytes (#6734)1 年前
[Dialect] Actually enable TMEM layout check and fix all the tests (#7723)10 个月前
[Dialect] Actually enable TMEM layout check and fix all the tests (#7723)10 个月前
[AMD] Fix alignment calculation for buffer ops offset in AxisAnalysis (#7886)9 个月前
[AMD] Add fast_tanhf to libdevice (#7780)9 个月前
[AMD] Enable lowerLocalLdSt for AMD path (#7355)10 个月前
[LLs] Tree-reduce the xor reduction in LLs codegen (#7816)9 个月前
[Dialect] Actually enable TMEM layout check and fix all the tests (#7723)10 个月前
[AMD] Enable lowerLocalLdSt for AMD path (#7355)10 个月前
[AMD] Improve register usage in Float8 conversions (#7527)10 个月前
[AMD] Optimize transposed GEMM operand cases (#6074)1 年前
[AMD] Support register broadcast in slice/concat ops (#7407)10 个月前
[AMD] Support register broadcast in slice/concat ops (#7407)10 个月前
[Dialect] Actually enable TMEM layout check and fix all the tests (#7723)10 个月前
[Dialect] Actually enable TMEM layout check and fix all the tests (#7723)10 个月前
Reland "byte permutes in intra-warp layout conversion" (#7933)9 个月前
[AMD] Eanble NAN propagating MIN/MAX for gfx950 (#6387)1 年前
[IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876)9 个月前
[AMD] Use permlanex16 for shuffleXor on rdna (#7269)11 个月前
[AMD] Enable lowerLocalLdSt for AMD path (#7355)10 个月前
[Backend] Bump to llvm/llvm-project@bc773632355b (#7881)9 个月前