| update: rename blocksparseattention -> adablocksparseattention | 1 个月前 |
| update: rename blocksparseattention -> adablocksparseattention | 1 个月前 |
| [feature]新增adaLayernormv2的plugin | 4 个月前 |
| [feature]新增adaLayernormv2的plugin | 4 个月前 |
| [feature] blockSparseAttention plugin & rf_v3 | 1 个月前 |
| [feature] blockSparseAttention plugin & rf_v3 | 1 个月前 |
| 【docs】文档修改-增加API参考&加速API | 1 个月前 |
| aclnn算子op plugin 公共接口 | 6 个月前 |
| [Feature][ops]aclnn编译工程适配 | 3 个月前 |
| [dev]同步最新代码 | 6 个月前 |
| [Feature][ops]aclnn编译工程适配 | 3 个月前 |
| [dev]同步最新代码 | 6 个月前 |
| [Bugfix][ops]Add dimension validation to prevent size_t underflow in layernorm | 22 天前 |
| [Bugfix][ops]Add dimension validation to prevent size_t underflow in layernorm | 22 天前 |
| feat: quant_flash_attn and quant_flash_attn_metadata operators | 14 天前 |
| feat: quant_flash_attn and quant_flash_attn_metadata operators | 14 天前 |
| feat: quant_flash_attn and quant_flash_attn_metadata operators | 14 天前 |
| feat: quant_flash_attn and quant_flash_attn_metadata operators | 14 天前 |
| feat: quant_flash_attn and quant_flash_attn_metadata operators | 14 天前 |
| 【docs】文档修改-增加API参考&加速API | 1 个月前 |
| [feature]新增RainFusionAttention算子的plugin | 5 个月前 |
| feat: quant_flash_attn and quant_flash_attn_metadata operators | 14 天前 |
| [dev]同步最新代码 | 6 个月前 |
| [dev]同步最新代码 | 6 个月前 |
| [Feature][ops]aclnn编译工程适配 | 3 个月前 |
| [feature]新增SparseBlockEstimate算子以及plugin和UT | 5 个月前 |