文件最后提交记录最后更新时间
[feature] qwen3vl support dynamic batching + vision encoder data balance Co-authored-by: huangdabiao<huangdabiao1@huawei.com> # message auto-generated for no-merge-commit merge: !2004 merge qwen3vl_dynamic_batching into master [feature] qwen3vl support dynamic batching + vision encoder data balance Created-by: huangdabiao Commit-by: huangdabiao Merged-by: ascend-robot Description: ## Motivation qwen3vl support dynamic batching + vision encoder data balance ## Modification qwen3vl support dynamic batching + vision encoder data balance ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!20044 个月前
[Docs] Annotation Standardization Rectification Co-authored-by: LKONE<wanglikai4@huawei.com> # message auto-generated for no-merge-commit merge: !2010 merge master into master [Docs] Annotation Standardization Rectification Created-by: wanglikai1019 Commit-by: LKONE Merged-by: ascend-robot Description: ## Motivation Rectify the annotations for the code repository in accordance with the annotation specification requirements. ## Modification 1. Replace Chinese annotations with English ones; 2. Organize the environment variable documentation and add explanations in the startup scripts (currently implemented on Qwen3VL and Wan2.2); 3. Add annotations to the public dataset functions and public model classes of Wan2.2 and Qwen3VL; 4. Adjust the annotation format (e.g., number of indentations). ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!20104 个月前
feat(torch):Support text-only pretraining Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com> # message auto-generated for no-merge-commit merge: !2261 merge pretrain_data_preprocess into master feat(torch):Support text-only pretraining Created-by: yaoyaoxu Commit-by: yaoyaoxu Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1.支持纯fsdp的纯文本预训练 2.支持megatron+fsdp双后端的纯文本预训练 3.提供预训练特性文档 ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!22612 个月前
[bugfix] fix bugs for wan2.2&qwen3vl fsdp checkpointing Co-authored-by: peng-hengduo<penghengduo@huawei.com> # message auto-generated for no-merge-commit merge: !2180 merge wan_checkpointing_bugfix into master [bugfix] fix bugs for wan2.2&qwen3vl fsdp checkpointing Created-by: peng-hengduo Commit-by: peng-hengduo Merged-by: ascend-robot Description: Fix the bugs of wan2.2 qwen3vl breakpointing. See merge request: Ascend/MindSpeed-MM!21803 个月前
[BugFix] Resolve the conflict between the data spliting operation in dynamic batching dataloader and certain data types output by the dataloader for qwen3vl Co-authored-by: huangdabiao<huangdabiao1@huawei.com> # message auto-generated for no-merge-commit merge: !2138 merge dynamic_batching_totest_0202 into master [BugFix] Resolve the conflict between the data spliting operation in dynamic batching dataloader and certain data types output by the dataloader for qwen3vl Created-by: huangdabiao Commit-by: huangdabiao Merged-by: ascend-robot Description: ## Motivation bug产生的原因: dynamic batching dataloader中需要将micro_batch(由原有dataloader得到)拆分成单条数据,以将对应的数据拼接成符合长度的数据。 qwen3 vl dataloader获取的batch中新增的“use_audio_in_video”数据为bool类型,与dynamic batching dataloader中的数据拆分逻辑(micro_batch拆分成mbs个数据)冲突 修复合理性: 本次pr将非tensor类型的数据单独保存,并在数据packing后将这些变量放回,避免迭代处理该数据。 ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!21383 个月前
!1433 [Bugfix] fix sora dpo multi-parameter PP for core0.12.1 Merge pull request !1433 from meng-coding/master 9 个月前