| fix(incre_flash_attention): 显式指定DataCopy模板参数避免KVINT4编译错误
Co-authored-by: leiqingji<leiqingji@h-partners.com>
# message auto-generated for no-merge-commit merge:
!5831 merge fix/incre-flash-attention-datacopy into master
fix(incre_flash_attention): 显式指定DataCopy模板参数避免KVINT4编译错误
Created-by: leiqingji
Commit-by: leiqingji
Merged-by: cann-robot
Description: ## 描述
在KVINT4分支的AntiquantAIterExpand函数中,DataCopy调用时显式指定<int8>模板参数,解决编译时模板参数推导失败的问题。
## 改动
- 修复 incre_flash_attention_preload.h 中 DataCopy 模板参数
- 修复 incre_flash_attention_preload_dd.h 中 DataCopy 模板参数
- 修复 incre_flash_attention_split_Bbn2s2_Us2.h 中 DataCopy 模板参数
## 关联的Issue
https://gitcode.com/cann/ops-transformer/issues/2179
## 测试
- 编译测试通过
## 类型标签
- [x] 🐛 Bug 修复
See merge request: cann/ops-transformer!5831 | 6 天前 |