| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
[Feature] Support custom optimizer lr scale and weight decay ignore Co-authored-by: suis-moi<zhushixiang@huawei.com> # message auto-generated for no-merge-commit merge: !1878 merge master into master [Feature] Support custom optimizer lr scale and weight decay ignore Created-by: suis-moi Commit-by: suis-moi Merged-by: ascend-robot Description: ## Motivation support custom optimizer lr scale and weight decay ignore ## Modification 1、Add condition function to determine if a parameter should be excluded from weight decay. 2、Add condition function to determine if a parameter should apply learning rate scaling (with --lr-mult). 使用示例: 在训练shell脚本的args添加 shell GPT_ARGS=" ... --weight-decay-exclude-modules norm bias \ # 排除指定参数的权重衰减,根据实际需要设置参数关键词 --lr-scale-modules vision \ # 对视觉模块参数缩放学习率(如mult=0.5) --lr-mult 0.5 \ " ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1878 | 6 个月前 | |
[Docs] Update mindspeed commit_id in user-guide Co-authored-by: AZe_404<wangze62@h-partners.com> # message auto-generated for no-merge-commit merge: !1893 merge user_guide into master [Docs] Update mindspeed commit_id in user-guide Created-by: AZe_404 Commit-by: AZe_404 Merged-by: ascend-robot Description: ## Motivation Update mindspeed commit_id in user-guide. ## Modification Update mindspeed commit_id in user-guide. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1893 | 6 个月前 | |
[Docs] Update the version of CANN in installation part Co-authored-by: AZe_404<wangze62@h-partners.com> # message auto-generated for no-merge-commit merge: !1953 merge update_installation into master [Docs] Update the version of CANN in installation part Created-by: AZe_404 Commit-by: AZe_404 Merged-by: ascend-robot Description: ## Motivation Update the version of CANN in installation part. ## Modification Update the version of CANN in installation part. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1953 | 6 个月前 | |
[Docs]Correct the wrong words. Co-authored-by: zs-derrick1<1434012475@qq.com> # message auto-generated for no-merge-commit merge: !1881 merge docs into master [Docs]Correct the wrong words. Created-by: zs-derrick1 Commit-by: zs-derrick1 Merged-by: ascend-robot Description: ## Motivation Correct the wrong words. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1881 | 6 个月前 | |
| 11 个月前 | ||
| 11 个月前 | ||
add docs for MindSpeed-MM introduction, hetero-parallel, Qwen3VL freture. Co-authored-by: cxiaolong<2845907121@qq.com> # message auto-generated for no-merge-commit merge: !1743 merge master into master [Docs] add Mindspeed-MM、hetero parallel docs, Qwen3VL feature docs Created-by: cxiaolong Commit-by: cxiaolong Merged-by: ascend-robot Description: ## Motivation add hetero parallel docs. ## Modification 1. hetero parallel docs. 2. MindSpeed-MM introducetion. 3. add Qwen3VL freture configuration. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1743 | 6 个月前 | |
[Docs]Add model-migration-guide doc Co-authored-by: zs-derrick1<1434012475@qq.com> # message auto-generated for no-merge-commit merge: !1852 merge docs into master [Docs]Add model-migration-guide doc Created-by: zs-derrick1 Commit-by: zs-derrick1 Merged-by: ascend-robot Description: ## Motivation Add model-migration-guide doc ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1852 | 6 个月前 | |
[Docs] Update mindspeed commit_id in user-guide Co-authored-by: AZe_404<wangze62@h-partners.com> # message auto-generated for no-merge-commit merge: !1893 merge user_guide into master [Docs] Update mindspeed commit_id in user-guide Created-by: AZe_404 Commit-by: AZe_404 Merged-by: ascend-robot Description: ## Motivation Update mindspeed commit_id in user-guide. ## Modification Update mindspeed commit_id in user-guide. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1893 | 6 个月前 |
| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
| 6 个月前 | ||
| 6 个月前 | ||
| 6 个月前 | ||
| 6 个月前 | ||
| 11 个月前 | ||
| 11 个月前 | ||
| 6 个月前 | ||
| 6 个月前 | ||
| 6 个月前 |