MindSpeed-MM/mindspeed_mm/configs · Ascend/MindSpeed-MM - AtomGit

ascend-robot[Feature] fsdp2 qwen3vl 30B support lora finetune

文件	最后提交记录	最后更新时间
config.py	[Feature] add st files for simplified qwen3vl configuration Co-authored-by: weixin_44031810<gaojie75@huawei.com> # message auto-generated for no-merge-commit merge: !2015 merge master into master [Feature] add st files for simplified qwen3vl configuration Created-by: gaojie_ Commit-by: weixin_44031810 Merged-by: ascend-robot Description: ## Motivation 因为修改了qwen3vl 30B的配置文件，所以需要修改相应qwen3vl 30b的ST文件，改为与当前对应的拉起和配置方式 ## Modification 新增qwen3vl 30b的ST配置文件和拉起shell 脚本 ## Self-test (Optional) 配置简化在之前的评审中已通过，主要是添加对应的ST文件。测试时间为2026.1.5. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!2015	4 个月前
read_yaml_config.py	[Feature] fsdp2 qwen3vl 30B support lora finetune Co-authored-by: LKONE<wanglikai4@huawei.com> # message auto-generated for no-merge-commit merge: !2189 merge master into master [Feature] fsdp2 qwen3vl 30B support lora finetune Created-by: wanglikai1019 Commit-by: LKONE Merged-by: ascend-robot Description: ## Motivation During gradient recomputation in the backward pass, it is necessary to obtain the gradient function outside this block as the entry point for the backward pass. Therefore, a patch is added to enable the gradient function of the embed_tokens layer, allowing LoRA fine-tuning to enter the gradient recomputation backward pass. Related history PR: https://gitee.com/ascend/MindSpeed-LLM/pulls/1498 ## Modification 1.Add a new LoRA fine-tuning script for 30B models 2.Add parsing code for LoRA parameters 3.Add a LoRA patch for the gradient recomputation scenario ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!2189	3 个月前
validate_params.json	【Bugfix】Correct the spelling errors in the code Co-authored-by: lu-jinfu1999<lujinfu1@h-partners.com> # message auto-generated for no-merge-commit merge: !1670 merge master into master 【Bugfix】Correct the spelling errors in the code Created-by: lu-jinfu1999 Commit-by: lu-jinfu1999 Merged-by: ascend-robot Description: ## Motivation Correct the spelling errors in the code. ## Modification Correct spelling errors in the document. Correct spelling errors in the code. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1670	6 个月前