文件最后提交记录最后更新时间
[AMD] NFC: Drop version minor for AMD MFMA layout (#7285)11 个月前
[AMD][NFC] use double quote instead of single quote (#6311)1 年前
[AMD] Improve scaled dot with (b)f16 types on GFX950 (#7693)10 个月前
[AMD] Support 4x64 and 64x4 MFMA layout for dot (#7576)10 个月前
[AMD] Disable dot wmma configurations that use f16 as accumulator (#6460)1 年前
[AMD][gfx12] Enable dot for f8 operands (#6814)1 年前
[IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876)9 个月前
[IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876)9 个月前
[AMD] Added a canonicalizer to ConcatOp (#7273)11 个月前
[AMD] Fix multi-yield scf.if during pointer canonicalization (#6276)1 年前
[Dialect] Actually enable TMEM layout check and fix all the tests (#7723)10 个月前
[AMD] Reject async global to local ops for less than 4 bytes (#6734)1 年前
[AMD] Support register broadcast in slice/concat ops (#7407)10 个月前
[AMD][NFC] use double quote instead of single quote (#6311)1 年前
[Dialect] Actually enable TMEM layout check and fix all the tests (#7723)10 个月前
[AMD] Fix buffer op mask operand removal (#7963)9 个月前
[AMD] Support register broadcast in slice/concat ops (#7407)10 个月前
[IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876)9 个月前
[AMD] NFC: Drop version minor for AMD MFMA layout (#7285)11 个月前
[AMD] Use single LDS for both transposed and non-transposed access (#7813)9 个月前
[Dialect] Actually enable TMEM layout check and fix all the tests (#7723)10 个月前
[AMD] Add ChainedDotSchedule to StreamPipeliner (#7601)10 个月前
[IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876)9 个月前
[IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876)9 个月前
[AMD] NFC: Drop version minor for AMD MFMA layout (#7285)11 个月前
[LAYOUTS] Enable generic swizzling on AMD (#7225)11 个月前
[LAYOUTS] Enable generic swizzling on AMD (#7225)11 个月前
[Gluon] Actually enable scf+cf+arith canonicalizers (#7775)9 个月前
[IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876)9 个月前
[LAYOUTS] Implement toLinearLayout for TensorMemoryEncodingAttr (#7748)10 个月前
[AMD] Support async load in ping-pong pass (#7458)10 个月前
[AMD] NFC: Drop version minor for AMD MFMA layout (#7285)11 个月前
[AMD] Prefer swizzle layout in optimize-lds-usage pass (#7750)9 个月前
[AMD] NFC: Drop version minor for AMD MFMA layout (#7285)11 个月前