文件最后提交记录最后更新时间
!3282 [mindspore][master]fix ringattentionupdate Merge pull request !3282 from 杨承翰/ring_master 8 个月前
!2744 【训练性能】兼容层named_modules递归获取参数性能优化 Merge pull request !2744 from zhangyihui/master 11 个月前
!3352 [mindspore][master][bugfix]patch_apply_llama3_scaling Merge pull request !3352 from xinyuan/patch_apply_llama3_scaling 8 个月前
fix(pytorch): fix duplicate transports in mla and rope Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4309 merge bugfix2 into master fix(pytorch): fix duplicate transports in mla and rope Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? fix duplicate transports in mla and rope ## Does this PR introduce any user-facing change? no ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!43092 个月前
[mindspore][model] optimize dsv3 performance Co-authored-by: wanglijun55<wanglijun54@huawei.com> # message auto-generated for no-merge-commit merge: !3871 merge master-ds3 into master [mindspore][model] optimize dsv3 performance Created-by: wanglijun55 Commit-by: wanglijun55 Merged-by: ascend-robot Description: 关联问题单:https://e.gitee.com/mind_spore/issues/table?issue=ID75AB 修改说明: 1、优化1:get_batch中for循环里多次调用cpu tensor的item、gt、bool等操作,上patch先转numpy再过for循环 自验证结果: 1、性能达到0.99x pta,精度对齐 ![image.png](https://raw.gitcode.com/user-images/assets/7623105/d8798baa-d45a-4511-bfce-4177cc348645/image.png 'image.png') ![Snipaste_2025-12-04_19-30-34.jpg](https://raw.gitcode.com/user-images/assets/7623105/e19f80d2-c4c9-40d8-a123-8210d7481c30/Snipaste_2025-12-04_19-30-34.jpg 'Snipaste_2025-12-04_19-30-34.jpg') See merge request: Ascend/MindSpeed-LLM!38715 个月前
[mindspore][bugfix][master] fix shared expert by removing stream Co-authored-by: ffmh<fengminghao2@huawei.com> # message auto-generated for no-merge-commit merge: !3917 merge fix_shared_master into master [mindspore][bugfix][master] fix shared expert by removing stream Created-by: ffmh Commit-by: ffmh Merged-by: ascend-robot Description: fix shared expert by removing stream See merge request: Ascend/MindSpeed-LLM!39175 个月前