MindSpeed-LLM/mindspeed_llm/features_manager/transformer · Ascend/MindSpeed-LLM - AtomGit

ascend-robotfeat(pytorch): support o lora rank and q lora rank in v4pro

文件	最后提交记录	最后更新时间
flash_attention	[pytorch][feature] kvallgather supports TND Co-authored-by: Jia_Austin<dengjia6@huawei.com> # message auto-generated for no-merge-commit merge: !4277 merge fix_te_tnd into master [pytorch][feature] kvallgather supports TND Created-by: Jia_Austin Commit-by: Jia_Austin Merged-by: ascend-robot Description: ## What this PR does / why we need it? feat: TE tnd ## Does this PR introduce any user-facing change? NA ## How was this patch tested? Turn on and off TE CP TND See merge request: Ascend/MindSpeed-LLM!4277	2 个月前
multi_latent_attention	feat(pytorch): support o lora rank and q lora rank in v4pro Co-authored-by: dingzicha1997<dingzilin@huawei.com> # message auto-generated for no-merge-commit merge: !4427 merge master into master feat(pytorch): support o lora rank and q lora rank in v4pro Created-by: dingzicha1997 Commit-by: dingzicha1997 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4427	1 个月前
qwen3_next_attention	fix: Disallow use-global-aux-loss when moe-alltoall-overlap-comm is enabled Co-authored-by: zzyyjj012<yangzj012@qq.com> # message auto-generated for no-merge-commit merge: !4383 merge master into master fix: Disallow use-global-aux-loss when moe-alltoall-overlap-comm is enabled Created-by: zzyyjj012 Commit-by: zzyyjj012 Merged-by: ascend-robot Description: ## What this PR does / why we need it? 当use-global-aux-loss和moe-alltoall-overlap-comm同时开启时，显式抛出异常。 ## Does this PR introduce any user-facing change? 无 ## How was this patch tested? 关闭moe-alltoall-overlap-comm时，use-global-aux-loss可正常开启使用。打开moe-alltoall-overlap-comm时，抛出异常。 See merge request: Ascend/MindSpeed-LLM!4383	1 个月前
__init__.py	!2923 [pytorch][refactor]mtp update Merge pull request !2923 from shengjy/mtp0626	10 个月前
mhc_feature.py	refactor(pytorch): update deepseek4 shell Co-authored-by: dingzicha1997<dingzilin@huawei.com> # message auto-generated for no-merge-commit merge: !4423 merge master into master refactor(pytorch): update deepseek4 shell Created-by: dingzicha1997 Commit-by: dingzicha1997 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4423	1 个月前
mtp.py	refactor(pytorch): update deepseek4 shell Co-authored-by: dingzicha1997<dingzilin@huawei.com> # message auto-generated for no-merge-commit merge: !4423 merge master into master refactor(pytorch): update deepseek4 shell Created-by: dingzicha1997 Commit-by: dingzicha1997 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4423	1 个月前
transformer_block.py	feat(pytorch): support deepseekv4_flash in mcore backend Co-authored-by: dingzicha1997<dingzilin@huawei.com> # message auto-generated for no-merge-commit merge: !4420 merge geneva2 into master feat(pytorch): support deepseekv4_flash in mcore backend Created-by: dingzicha1997 Commit-by: dingzicha1997 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4420	1 个月前