| [pytorch][bugfix]fix hunyuan tune bug
Co-authored-by: guozhihua<guozhihua2@huawei.com>
# message auto-generated for no-merge-commit merge:
!3519 merge huanyuan_tune_fix_2.2 into 2.2.0
[pytorch][bugfix]fix hunyuan tune bug
Created-by: guozhihua2
Commit-by: guozhihua
Merged-by: ascend-robot
Description: 1.change hunyuan tune pad-to-multiple-of to 8 and add use-flash-attn
2.add recompute-activation-function in qwen25-72b-4k-pack
See merge request: Ascend/MindSpeed-LLM!3519 | 7 个月前 |