| [AMD] NFC: Drop version minor for AMD MFMA layout (#7285) | 11 个月前 |
| [AMD][NFC] use double quote instead of single quote (#6311) | 1 年前 |
| [AMD] Improve scaled dot with (b)f16 types on GFX950 (#7693) | 10 个月前 |
| [AMD] Support 4x64 and 64x4 MFMA layout for dot (#7576) | 10 个月前 |
| [AMD] Disable dot wmma configurations that use f16 as accumulator (#6460) | 1 年前 |
| [AMD][gfx12] Enable dot for f8 operands (#6814) | 1 年前 |
| [IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876) | 9 个月前 |
| [IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876) | 9 个月前 |
| [AMD] Added a canonicalizer to ConcatOp (#7273) | 11 个月前 |
| [AMD] Fix multi-yield scf.if during pointer canonicalization (#6276) | 1 年前 |
| [Dialect] Actually enable TMEM layout check and fix all the tests (#7723) | 10 个月前 |
| [AMD] Reject async global to local ops for less than 4 bytes (#6734) | 1 年前 |
| [AMD] Support register broadcast in slice/concat ops (#7407) | 10 个月前 |
| [AMD][NFC] use double quote instead of single quote (#6311) | 1 年前 |
| [Dialect] Actually enable TMEM layout check and fix all the tests (#7723) | 10 个月前 |
| [AMD] Fix buffer op mask operand removal (#7963) | 9 个月前 |
| [AMD] Support register broadcast in slice/concat ops (#7407) | 10 个月前 |
| [IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876) | 9 个月前 |
| [AMD] NFC: Drop version minor for AMD MFMA layout (#7285) | 11 个月前 |
| [AMD] Use single LDS for both transposed and non-transposed access (#7813) | 9 个月前 |
| [Dialect] Actually enable TMEM layout check and fix all the tests (#7723) | 10 个月前 |
| [AMD] Add ChainedDotSchedule to StreamPipeliner (#7601) | 10 个月前 |
| [IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876) | 9 个月前 |
| [IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876) | 9 个月前 |
| [AMD] NFC: Drop version minor for AMD MFMA layout (#7285) | 11 个月前 |
| [LAYOUTS] Enable generic swizzling on AMD (#7225) | 11 个月前 |
| [LAYOUTS] Enable generic swizzling on AMD (#7225) | 11 个月前 |
| [Gluon] Actually enable scf+cf+arith canonicalizers (#7775) | 9 个月前 |
| [IR] Improve memdesc_index printing to make it obvious what operand is the index (#7876) | 9 个月前 |
| [LAYOUTS] Implement toLinearLayout for TensorMemoryEncodingAttr (#7748) | 10 个月前 |
| [AMD] Support async load in ping-pong pass (#7458) | 10 个月前 |
| [AMD] NFC: Drop version minor for AMD MFMA layout (#7285) | 11 个月前 |
| [AMD] Prefer swizzle layout in optimize-lds-usage pass (#7750) | 9 个月前 |
| [AMD] NFC: Drop version minor for AMD MFMA layout (#7285) | 11 个月前 |