文件最后提交记录最后更新时间
feature(pytorch): FSDP2 support hardware-adaptive execution Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4343 merge fsdp2_gpu into master feature(pytorch): FSDP2 support hardware-adaptive execution Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? FSDP2 support hardware-adaptive execution ## Does this PR introduce any user-facing change? No ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!43432 个月前
feat(pytorch):ring cp support MLA/GQA Co-authored-by: mhh111<mahonghao1@huawei.com> # message auto-generated for no-merge-commit merge: !4412 merge ring0418 into master feat(pytorch):ring cp support MLA/GQA Created-by: mhh111 Commit-by: mhh111 Merged-by: ascend-robot Description: ## What this PR does / why we need it? ring cp support MLA/GQA ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!44121 个月前
chore(fsdp2): develop longcat-flash-lite model in fsdp2 Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4344 merge br_master_longcat_flash_lite_fsdp2 into master chore(fsdp2): develop longcat-flash-lite model in fsdp2 Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: ## What this PR does / why we need it? develop longcat-flash-lite model in fsdp2 ## Does this PR introduce any user-facing change? new model development, no user-facing change ## How was this patch tested? Run the inference task and check whether the model can perform normal dialogs. See merge request: Ascend/MindSpeed-LLM!434410 小时前
feature(pytorch): FSDP2 support hardware-adaptive execution Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4343 merge fsdp2_gpu into master feature(pytorch): FSDP2 support hardware-adaptive execution Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? FSDP2 support hardware-adaptive execution ## Does this PR introduce any user-facing change? No ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!43432 个月前
chore(fsdp2): develop longcat-flash-lite model in fsdp2 Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4344 merge br_master_longcat_flash_lite_fsdp2 into master chore(fsdp2): develop longcat-flash-lite model in fsdp2 Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: ## What this PR does / why we need it? develop longcat-flash-lite model in fsdp2 ## Does this PR introduce any user-facing change? new model development, no user-facing change ## How was this patch tested? Run the inference task and check whether the model can perform normal dialogs. See merge request: Ascend/MindSpeed-LLM!434410 小时前
chore(fsdp2): develop longcat-flash-lite model in fsdp2 Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4344 merge br_master_longcat_flash_lite_fsdp2 into master chore(fsdp2): develop longcat-flash-lite model in fsdp2 Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: ## What this PR does / why we need it? develop longcat-flash-lite model in fsdp2 ## Does this PR introduce any user-facing change? new model development, no user-facing change ## How was this patch tested? Run the inference task and check whether the model can perform normal dialogs. See merge request: Ascend/MindSpeed-LLM!434410 小时前
chore(fsdp2): develop longcat-flash-lite model in fsdp2 Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4344 merge br_master_longcat_flash_lite_fsdp2 into master chore(fsdp2): develop longcat-flash-lite model in fsdp2 Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: ## What this PR does / why we need it? develop longcat-flash-lite model in fsdp2 ## Does this PR introduce any user-facing change? new model development, no user-facing change ## How was this patch tested? Run the inference task and check whether the model can perform normal dialogs. See merge request: Ascend/MindSpeed-LLM!434410 小时前
feat(pytorch): FSDP2 support GDN cp Co-authored-by: mhh111<mahonghao1@huawei.com> # message auto-generated for no-merge-commit merge: !4510 merge 0522_1 into master feat(pytorch): FSDP2 support GDN cp Created-by: mhh111 Commit-by: mhh111 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!45106 天前
chore(fsdp2): develop longcat-flash-lite model in fsdp2 Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4344 merge br_master_longcat_flash_lite_fsdp2 into master chore(fsdp2): develop longcat-flash-lite model in fsdp2 Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: ## What this PR does / why we need it? develop longcat-flash-lite model in fsdp2 ## Does this PR introduce any user-facing change? new model development, no user-facing change ## How was this patch tested? Run the inference task and check whether the model can perform normal dialogs. See merge request: Ascend/MindSpeed-LLM!434410 小时前
[pytorch][feature]load model and get args adaptation for fsdp2 backend Co-authored-by: cjy840282<chenjingyi9@huawei.com> # message auto-generated for no-merge-commit merge: !4087 merge master into master [pytorch][feature]load model and get args adaptation for fsdp2 backend Created-by: cjy840282 Commit-by: cjy840282 Merged-by: ascend-robot Description: load model and get args adaptation See merge request: Ascend/MindSpeed-LLM!40874 个月前