文件最后提交记录最后更新时间
[mindspore] support qwen3vl finetune for megatron backend Co-authored-by: ffmh<fengminghao2@huawei.com> # message auto-generated for no-merge-commit merge: !1785 merge qwen3vl_megatron into master [mindspore] support qwen3vl finetune for megatron backend Created-by: ffmh Commit-by: ffmh Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. 适配 qwen3vl 8B/30B megatron后端 ## Modification Please briefly describe what modification is made in this PR. 添加 layer_spec修改、text模块transformer block/layer添加deepstack相关适配 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17856 个月前
[Modify] Qwen2.5omni use videoprocessor to adapt transformers 4.53.0 Co-authored-by: AZe_404<wangze62@h-partners.com> # message auto-generated for no-merge-commit merge: !1891 merge video_proc_230 into 2.3.0 [Modify] Qwen2.5omni use videoprocessor to adapt transformers 4.53.0 Created-by: AZe_404 Commit-by: AZe_404 Merged-by: ascend-robot Description: ## Motivation Qwen2.5omni use videoprocessor to adapt transformers 4.53.0 . ## Modification Qwen2.5omni use videoprocessor to adapt transformers 4.53.0 . ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!18915 个月前
[Modify][MindSpore] Optimize data preprocessing performance for Qwen3vl Co-authored-by: weixin_47897441<wuyouqi1@h-partners.com> # message auto-generated for no-merge-commit merge: !1908 merge 2.3.0-qwenvl3 into 2.3.0 [Modify][MindSpore] Optimize data preprocessing performance for Qwen3vl Created-by: weixin_47897441 Commit-by: weixin_47897441 Merged-by: ascend-robot Description: ## Motivation This PR aims to optimize the performance of qwenvl3 data preprocessing by replacing partial tensor operations with tensor-to-numpy conversions, reducing unnecessary tensor memory copies and improving the overall efficiency of the preprocessing pipeline. ## Modification Replace tensor-based data operations that cause redundant memory copies with tensor-to-numpy conversion operations. ## Self-test (Optional) 1.After modification, the output of qwenvl3 data preprocessing is consistent with the original version. 2.The preprocessing time is reduced from 10xPTA to 5.2xPTA. ![image.png](https://raw.gitcode.com/user-images/assets/7404510/3265ef95-3228-4034-be9c-8d5be0de5e32/image.png 'image.png') ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!19085 个月前
[Bugfix]Layerzero saving weight bugfix-branch 2.3.0 Co-authored-by: zs-derrick1<1434012475@qq.com> # message auto-generated for no-merge-commit merge: !1943 merge 2.3.0 into 2.3.0 [Bugfix]Layerzero saving weight bugfix-branch 2.3.0 Created-by: zs-derrick1 Commit-by: zs-derrick1 Merged-by: ascend-robot Description: ## Motivation Layerzero saving weight bugfix-branch 2.3.0 ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!19435 个月前
[Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion Co-authored-by: meng-coding<wumengjie6@huawei.com> Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com> # message auto-generated for no-merge-commit merge: !1898 merge pick_master_ckpt_to_230 into 2.3.0 [Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion Created-by: yaoyaoxu Commit-by: yaoyaoxu;meng-coding Merged-by: ascend-robot Description: ## Motivation Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion ## Modification Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!18985 个月前
[Feature]DanceGPRO add unit test Co-authored-by: young256<young256@noreply.gitcode.com> # message auto-generated for no-merge-commit merge: !1702 merge dancegrpo-ut into master [Feature]DanceGPRO add unit test Created-by: young256 Commit-by: young256 Merged-by: ascend-robot Description: ## Motivation DanceGPRO add unit test. ## Modification DanceGPRO add unit test. ## Self-test (Optional) Not involved. ## BC-breaking (Optional) Not involved. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17026 个月前
[Feature]VACE Model and Data Process Co-authored-by: feng0w0<houyufeng4@huawei.com> # message auto-generated for no-merge-commit merge: !1669 merge master into master [Feature]VACE Model and Data Process Created-by: feng0w0 Commit-by: feng0w0 Merged-by: ascend-robot Description: ## Motivation Support VACE Model ## Modification Added VACE data processing pipeline and model components. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!16696 个月前
[mindspore] support qwen3vl finetune for megatron backend Co-authored-by: ffmh<fengminghao2@huawei.com> # message auto-generated for no-merge-commit merge: !1785 merge qwen3vl_megatron into master [mindspore] support qwen3vl finetune for megatron backend Created-by: ffmh Commit-by: ffmh Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. 适配 qwen3vl 8B/30B megatron后端 ## Modification Please briefly describe what modification is made in this PR. 添加 layer_spec修改、text模块transformer block/layer添加deepstack相关适配 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17856 个月前
!1322 [Bugfix] lora patch lazy init Merge pull request !1322 from zs-Derrick/master 10 个月前
[Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch Co-authored-by: huangdabiao<huangdabiao1@huawei.com> # message auto-generated for no-merge-commit merge: !1736 merge test_final_1111 into master [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch Created-by: huangdabiao Commit-by: huangdabiao Merged-by: ascend-robot Description: ## Motivation [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch ## Modification [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17366 个月前
[Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch Co-authored-by: huangdabiao<huangdabiao1@huawei.com> # message auto-generated for no-merge-commit merge: !1736 merge test_final_1111 into master [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch Created-by: huangdabiao Commit-by: huangdabiao Merged-by: ascend-robot Description: ## Motivation [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch ## Modification [Modify] hetero-pp support encoder/decoder different micro-batch-size & improve qwen2.5omni data balance on global batch ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17366 个月前