| [BACKEND] Refactor shared memory representation in TTGIR (#3261)
Existing shared memory representation in TTGIR had some semantic
problems as it was mixing value semantic with memory semantic.
In order to solve that this PR moves shared memory representation to
memory semantic only.
This means shared memory is now not represented as tensors but as
allocations that may or may not be mutable.
tensor cannot have shared encoding now.
Convert_layout #distributed -> #shared becomes triton_gpu.local_alloc %init
tensor<#distributed> -> !tt.memdesc<#shared>
Convert_layout #shared -> #distributed becomes triton_gpu.local_load
%mem !tt.memdesc<#shared> -> tensor<#distributed>
Insert_slice_async becomes async_copy_global_to_local | 2 年前 |