文件最后提交记录最后更新时间
[mindspore][master]adapt swap attention for Qwen2.5vl-72B Co-authored-by: iquoyuw<wuyouqi1@h-partners.com> # message auto-generated for no-merge-commit merge: !2999 merge sa-master into master [mindspore][master]adapt swap attention for Qwen2.5vl-72B Created-by: weixin_47897441 Commit-by: iquoyuw Merged-by: ascend-robot Description: Adapt swap attention for Qwen2.5vl-72B **修改说明:** 适配swap attention特性,优化qwen2.5vl-72B显存。 **自验证结果:** 1、特性功能与ptA对齐,相同算子下显存缩减量一致: ![image.png](https://raw.gitcode.com/user-images/assets/7404741/a62b945b-eddb-4936-82dd-2af01184caaa/image.png 'image.png') 2、开启特性前后与ptA精度零误差对齐: ![image.png](https://raw.gitcode.com/user-images/assets/7404741/c31f9943-3a7d-465d-8fb2-be7eb83cca02/image.png 'image.png') See merge request: Ascend/MindSpeed!29996 个月前