文件最后提交记录最后更新时间
[pytorch][feature]GPT-OSS supports EP training Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> # message auto-generated for no-merge-commit merge: !4184 merge master into master [pytorch][feature]GPT-OSS supports EP training Created-by: sunjunjie1587 Commit-by: sunjunjie1587 Merged-by: ascend-robot Description: GPT-OSS supports EP training See merge request: Ascend/MindSpeed-LLM!41843 个月前
feature(pytorch): FSDP2 support hardware-adaptive execution Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4343 merge fsdp2_gpu into master feature(pytorch): FSDP2 support hardware-adaptive execution Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? FSDP2 support hardware-adaptive execution ## Does this PR introduce any user-facing change? No ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!43432 个月前
[pytorch][ci]Add pretrain and finetune ST for FSDP2 backend Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> # message auto-generated for no-merge-commit merge: !4204 merge master into master [pytorch][ci]Add pretrain and finetune ST for FSDP2 backend Created-by: sunjunjie1587 Commit-by: sunjunjie1587 Merged-by: ascend-robot Description: Add pretrain and finetune ST for FSDP2 backend See merge request: Ascend/MindSpeed-LLM!42043 个月前
[feat] Add FSDP for MXFP8 Co-authored-by: EVA1<jingsiyu1@huawei.com> Co-authored-by: quancs001<quancs@qq.com> Co-authored-by: h00638954<huangzhiyuan8@huawei.com> # message auto-generated for no-merge-commit merge: !4379 merge fsdp_comm into master [feat] Add FSDP for MXFP8 Created-by: quancs001 Commit-by: EVA1;quancs001;h00638954 Merged-by: ascend-robot Description: 添加MXFP8 FSDP功能,支持Dense/MoE(EP+EFSDP) ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!43791 个月前
chore(fsdp2): develop longcat-flash-lite model in fsdp2 Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4344 merge br_master_longcat_flash_lite_fsdp2 into master chore(fsdp2): develop longcat-flash-lite model in fsdp2 Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: ## What this PR does / why we need it? develop longcat-flash-lite model in fsdp2 ## Does this PR introduce any user-facing change? new model development, no user-facing change ## How was this patch tested? Run the inference task and check whether the model can perform normal dialogs. See merge request: Ascend/MindSpeed-LLM!434415 小时前
feature(pytorch): FSDP2 support hardware-adaptive execution Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4343 merge fsdp2_gpu into master feature(pytorch): FSDP2 support hardware-adaptive execution Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? FSDP2 support hardware-adaptive execution ## Does this PR introduce any user-facing change? No ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!43432 个月前