文件最后提交记录最后更新时间
[Refactor]Decouple Qwen3-Omni's reuse logic, adapt dedicated DataCollator Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com> # message auto-generated for no-merge-commit merge: !2151 merge qwen3omni_collator into master [Refactor]Decouple Qwen3-Omni's reuse logic, adapt dedicated DataCollator Created-by: yaoyaoxu Commit-by: yaoyaoxu Merged-by: ascend-robot Description: ## Motivation 由于当前仓上没有其他模型需要音频相关特征,单独实现qwen3omni的DataCollator,与其他模型解耦 ## Modification 1.修改点:新增名为qwen3omni的DATA_COLLATOR,继承DataCollatorForQwen2vl,增加音频特征 2.已验证:改写后能获取到音频特征 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!21513 个月前
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261614 小时前
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261614 小时前
[Docs] Annotation Standardization Rectification Co-authored-by: LKONE<wanglikai4@huawei.com> # message auto-generated for no-merge-commit merge: !2010 merge master into master [Docs] Annotation Standardization Rectification Created-by: wanglikai1019 Commit-by: LKONE Merged-by: ascend-robot Description: ## Motivation Rectify the annotations for the code repository in accordance with the annotation specification requirements. ## Modification 1. Replace Chinese annotations with English ones; 2. Organize the environment variable documentation and add explanations in the startup scripts (currently implemented on Qwen3VL and Wan2.2); 3. Add annotations to the public dataset functions and public model classes of Wan2.2 and Qwen3VL; 4. Adjust the annotation format (e.g., number of indentations). ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!20104 个月前
fix: 修复多个关键bug和安全隐患 Co-authored-by: 王姜奔<wangjiangben@huawei.com> # message auto-generated for no-merge-commit merge: !2340 merge master into master fix: 修复多个关键bug和安全隐患 Created-by: wangjiangben Commit-by: 王姜奔 Merged-by: ascend-robot Description: ## 修复内容 本次提交修复了MindSpeed-MM代码仓库中的多个关键bug和安全隐患: ### 1. 修复共享内存资源泄漏问题 - 文件: mindspeed_mm/data/data_utils/utils.py - 问题: 共享内存创建后未调用 unlink() 释放,导致内存泄漏 - 修复: 添加 try-finally 块确保资源正确释放 ### 2. 修复除零风险问题 - 文件: mindspeed_mm/models/vlm_model.py - 问题: 损失计算时 token_nums 可能为零导致除零错误 - 修复: 使用 torch.clamp 和零值检查防止除零 ### 3. 修复代码注入安全隐患 - 文件: mindspeed_mm/tasks/rl/soragrpo/utils/fsdp_util.py - 问题: 使用 eval() 函数存在代码注入风险 - 修复: 使用安全的字符串解析方式处理分数 ### 4. 改进异常处理 - 文件: mindspeed_mm/data/data_utils/func_utils/mm_plugin.py - 问题: 异常捕获过于宽泛,影响调试 - 修复: 细化异常类型,添加错误日志 ## 测试 - ✅ 代码已通过本地测试 - ✅ 不影响现有功能 - ✅ 提高了代码安全性和稳定性 ## 修改统计 - 修改文件: 4个 - 新增代码: 48行 - 删除代码: 22行 See merge request: Ascend/MindSpeed-MM!23401 个月前
[Feature] Add Magistral3 Co-authored-by: wanghao<wanghao773@h-partners.com> # message auto-generated for no-merge-commit merge: !2066 merge data_mastral into master [Feature] Add Magistral3 Created-by: vectorwhh Commit-by: wanghao Merged-by: ascend-robot Description: 【Feature】Add Magistral3 使用fsdp2结构适配,数据模块迁移自llama factory See merge request: Ascend/MindSpeed-MM!20664 个月前
feat(torch):Support text-only pretraining Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com> # message auto-generated for no-merge-commit merge: !2261 merge pretrain_data_preprocess into master feat(torch):Support text-only pretraining Created-by: yaoyaoxu Commit-by: yaoyaoxu Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1.支持纯fsdp的纯文本预训练 2.支持megatron+fsdp双后端的纯文本预训练 3.提供预训练特性文档 ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!22612 个月前
[Feature] Add Magistral3 Co-authored-by: wanghao<wanghao773@h-partners.com> # message auto-generated for no-merge-commit merge: !2066 merge data_mastral into master [Feature] Add Magistral3 Created-by: vectorwhh Commit-by: wanghao Merged-by: ascend-robot Description: 【Feature】Add Magistral3 使用fsdp2结构适配,数据模块迁移自llama factory See merge request: Ascend/MindSpeed-MM!20664 个月前