MindSpeed-LLM/mindspeed_llm/fsdp2/distributed/expert_parallel · Ascend/MindSpeed-LLM - AtomGit

ascend-robotchore(fsdp2): develop longcat-flash-lite model in fsdp2

文件	最后提交记录	最后更新时间
__init__.py	[pytorch][feature]GPT-OSS supports EP training Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> # message auto-generated for no-merge-commit merge: !4184 merge master into master [pytorch][feature]GPT-OSS supports EP training Created-by: sunjunjie1587 Commit-by: sunjunjie1587 Merged-by: ascend-robot Description: GPT-OSS supports EP training See merge request: Ascend/MindSpeed-LLM!4184	3 个月前
dispatcher.py	feature(pytorch): FSDP2 support hardware-adaptive execution Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4343 merge fsdp2_gpu into master feature(pytorch): FSDP2 support hardware-adaptive execution Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? FSDP2 support hardware-adaptive execution ## Does this PR introduce any user-facing change? No ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!4343	2 个月前
dispatcher_mc2.py	[pytorch][ci]Add pretrain and finetune ST for FSDP2 backend Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> # message auto-generated for no-merge-commit merge: !4204 merge master into master [pytorch][ci]Add pretrain and finetune ST for FSDP2 backend Created-by: sunjunjie1587 Commit-by: sunjunjie1587 Merged-by: ascend-robot Description: Add pretrain and finetune ST for FSDP2 backend See merge request: Ascend/MindSpeed-LLM!4204	3 个月前
expert_fully_shard_parallel.py	[feat] Add FSDP for MXFP8 Co-authored-by: EVA1<jingsiyu1@huawei.com> Co-authored-by: quancs001<quancs@qq.com> Co-authored-by: h00638954<huangzhiyuan8@huawei.com> # message auto-generated for no-merge-commit merge: !4379 merge fsdp_comm into master [feat] Add FSDP for MXFP8 Created-by: quancs001 Commit-by: EVA1;quancs001;h00638954 Merged-by: ascend-robot Description: 添加MXFP8 FSDP功能，支持Dense/MoE（EP+EFSDP） ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4379	1 个月前
expert_parallel.py	chore(fsdp2): develop longcat-flash-lite model in fsdp2 Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4344 merge br_master_longcat_flash_lite_fsdp2 into master chore(fsdp2): develop longcat-flash-lite model in fsdp2 Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: ## What this PR does / why we need it? develop longcat-flash-lite model in fsdp2 ## Does this PR introduce any user-facing change? new model development, no user-facing change ## How was this patch tested? Run the inference task and check whether the model can perform normal dialogs. See merge request: Ascend/MindSpeed-LLM!4344	15 小时前
utils.py	feature(pytorch): FSDP2 support hardware-adaptive execution Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4343 merge fsdp2_gpu into master feature(pytorch): FSDP2 support hardware-adaptive execution Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? FSDP2 support hardware-adaptive execution ## Does this PR introduce any user-facing change? No ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!4343	2 个月前