| [mindspore][master]adapt swap attention for Qwen2.5vl-72B
Co-authored-by: iquoyuw<wuyouqi1@h-partners.com>
# message auto-generated for no-merge-commit merge:
!2999 merge sa-master into master
[mindspore][master]adapt swap attention for Qwen2.5vl-72B
Created-by: weixin_47897441
Commit-by: iquoyuw
Merged-by: ascend-robot
Description: Adapt swap attention for Qwen2.5vl-72B
**修改说明:**
适配swap attention特性,优化qwen2.5vl-72B显存。
**自验证结果:**
1、特性功能与ptA对齐,相同算子下显存缩减量一致:

2、开启特性前后与ptA精度零误差对齐:

See merge request: Ascend/MindSpeed!2999 | 6 个月前 |