MindSpeed-MM/mindspeed_mm · Ascend/MindSpeed-MM - AtomGit

ascend-robot[Bugfix]Layerzero saving weight bugfix-branch 2.3.0

4da05733创建于 2025年12月16日历史提交

文件	最后提交记录	最后更新时间
configs	[mindspore] support qwen3vl finetune for megatron backend Co-authored-by: ffmh<fengminghao2@huawei.com> # message auto-generated for no-merge-commit merge: !1785 merge qwen3vl_megatron into master [mindspore] support qwen3vl finetune for megatron backend Created-by: ffmh Commit-by: ffmh Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. 适配 qwen3vl 8B/30B megatron后端 ## Modification Please briefly describe what modification is made in this PR. 添加 layer_spec修改、text模块transformer block/layer添加deepstack相关适配 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1785	6 个月前
data	[Modify] Qwen2.5omni use videoprocessor to adapt transformers 4.53.0 Co-authored-by: AZe_404<wangze62@h-partners.com> # message auto-generated for no-merge-commit merge: !1891 merge video_proc_230 into 2.3.0 [Modify] Qwen2.5omni use videoprocessor to adapt transformers 4.53.0 Created-by: AZe_404 Commit-by: AZe_404 Merged-by: ascend-robot Description: ## Motivation Qwen2.5omni use videoprocessor to adapt transformers 4.53.0 . ## Modification Qwen2.5omni use videoprocessor to adapt transformers 4.53.0 . ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1891	5 个月前
mindspore	[Modify][MindSpore] Optimize data preprocessing performance for Qwen3vl Co-authored-by: weixin_47897441<wuyouqi1@h-partners.com> # message auto-generated for no-merge-commit merge: !1908 merge 2.3.0-qwenvl3 into 2.3.0 [Modify][MindSpore] Optimize data preprocessing performance for Qwen3vl Created-by: weixin_47897441 Commit-by: weixin_47897441 Merged-by: ascend-robot Description: ## Motivation This PR aims to optimize the performance of qwenvl3 data preprocessing by replacing partial tensor operations with tensor-to-numpy conversions, reducing unnecessary tensor memory copies and improving the overall efficiency of the preprocessing pipeline. ## Modification Replace tensor-based data operations that cause redundant memory copies with tensor-to-numpy conversion operations. ## Self-test (Optional) 1.After modification, the output of qwenvl3 data preprocessing is consistent with the original version. 2.The preprocessing time is reduced from 10xPTA to 5.2xPTA. ![image.png](https://raw.gitcode.com/user-images/assets/7404510/3265ef95-3228-4034-be9c-8d5be0de5e32/image.png 'image.png') ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1908	5 个月前
models	[Bugfix]Layerzero saving weight bugfix-branch 2.3.0 Co-authored-by: zs-derrick1<1434012475@qq.com> # message auto-generated for no-merge-commit merge: !1943 merge 2.3.0 into 2.3.0 [Bugfix]Layerzero saving weight bugfix-branch 2.3.0 Created-by: zs-derrick1 Commit-by: zs-derrick1 Merged-by: ascend-robot Description: ## Motivation Layerzero saving weight bugfix-branch 2.3.0 ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1943	5 个月前
patchs	[Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion Co-authored-by: meng-coding<wumengjie6@huawei.com> Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com> # message auto-generated for no-merge-commit merge: !1898 merge pick_master_ckpt_to_230 into 2.3.0 [Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion Created-by: yaoyaoxu Commit-by: yaoyaoxu;meng-coding Merged-by: ascend-robot Description: ## Motivation Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion ## Modification Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1898	5 个月前
tasks	[Feature]DanceGPRO add unit test Co-authored-by: young256<young256@noreply.gitcode.com> # message auto-generated for no-merge-commit merge: !1702 merge dancegrpo-ut into master [Feature]DanceGPRO add unit test Created-by: young256 Commit-by: young256 Merged-by: ascend-robot Description: ## Motivation DanceGPRO add unit test. ## Modification DanceGPRO add unit test. ## Self-test (Optional) Not involved. ## BC-breaking (Optional) Not involved. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1702	6 个月前
tools	[Feature]VACE Model and Data Process Co-authored-by: feng0w0<houyufeng4@huawei.com> # message auto-generated for no-merge-commit merge: !1669 merge master into master [Feature]VACE Model and Data Process Created-by: feng0w0 Commit-by: feng0w0 Merged-by: ascend-robot Description: ## Motivation Support VACE Model ## Modification Added VACE data processing pipeline and model components. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1669	6 个月前
utils	[mindspore] support qwen3vl finetune for megatron backend Co-authored-by: ffmh<fengminghao2@huawei.com> # message auto-generated for no-merge-commit merge: !1785 merge qwen3vl_megatron into master [mindspore] support qwen3vl finetune for megatron backend Created-by: ffmh Commit-by: ffmh Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. 适配 qwen3vl 8B/30B megatron后端 ## Modification Please briefly describe what modification is made in this PR. 添加 layer_spec修改、text模块transformer block/layer添加deepstack相关适配 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1785	6 个月前
__init__.py	!1322 [Bugfix] lora patch lazy init Merge pull request !1322 from zs-Derrick/master	10 个月前
arguments.py	[Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch Co-authored-by: huangdabiao<huangdabiao1@huawei.com> # message auto-generated for no-merge-commit merge: !1736 merge test_final_1111 into master [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch Created-by: huangdabiao Commit-by: huangdabiao Merged-by: ascend-robot Description: ## Motivation [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch ## Modification [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1736	6 个月前
training.py	[Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch Co-authored-by: huangdabiao<huangdabiao1@huawei.com> # message auto-generated for no-merge-commit merge: !1736 merge test_final_1111 into master [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch Created-by: huangdabiao Commit-by: huangdabiao Merged-by: ascend-robot Description: ## Motivation [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch ## Modification [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1736	6 个月前