文件最后提交记录最后更新时间
[BugFix] Fix the bug in GBS data balance where state_buffer lacks the key "non_balanced_data". Co-authored-by: huangdabiao<huangdabiao1@huawei.com> # message auto-generated for no-merge-commit merge: !2232 merge gbs_data_balance_bugfix_0302 into master [BugFix] Fix the bug in GBS data balance where state_buffer lacks the key "non_balanced_data". Created-by: huangdabiao Commit-by: huangdabiao Merged-by: ascend-robot Description: ## Motivation bug产生的原因: self.state_buffer[data_type]["non_balanced_data"]在batch中含有不可迭代对象时初始化为字典并存储不可迭代对象, 并在存储之后通过 for data_name in self.state_buffer[data_type]["non_balanced_data"]: batch.pop(data_name) 将不可迭代对象从batch中暂时删除,以避免后续均衡过程中产生数据不可拆分的bug。 但是在batch不含有不可迭代对象的情况下,self.state_buffer[data_type]["non_balanced_data"]未被初始化,造成self.state_buffer[data_type]找不到key"non_balanced_data"的bug 修复合理性: 本次pr添加判断条件,仅在存在key"non_balanced_data"的情况下调用self.state_buffer[data_type]["non_balanced_data"],规避找不到key的错误 ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!22322 个月前
feat(torch): hunyuanvideo 1.5 i2v adapt Co-authored-by: Miss_min<qiaoxiaomin@huawei.com> # message auto-generated for no-merge-commit merge: !2221 merge master into master feat(torch): hunyuanvideo 1.5 i2v adapt Created-by: Miss_min Commit-by: Miss_min Merged-by: ascend-robot Description: ## Motivation hunyuanvideo 1.5 i2v 适配 ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!22212 个月前
[Feature]CP-reconstruct for hetero-parallel Co-authored-by: mazhuang<mazhuang21@huawei.com> # message auto-generated for no-merge-commit merge: !2001 merge CP-re-construct into master [Feature]CP-reconstruct for hetero-parallel Created-by: mazhuang1234 Commit-by: mazhuang Merged-by: ascend-robot Description: ## Motivation [Feature]CP-reconstruct for hetero-parallel ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ![image.png](https://raw.gitcode.com/user-images/assets/7404510/d7e89876-6492-4003-8282-c256077c1331/image.png 'image.png') ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!20014 个月前
!1297 [Bugfix]Security Update: Remove OS Info Print, Defend Formula Injection, Add Security Utils, Disable 0.0.0.0 and add public IP Merge pull request !1297 from zhangxubin/master 10 个月前
feat(torch): Qwen3-Omni support ulysses cp / fix(torch): repeat_kv and activation_offload bug Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com> # message auto-generated for no-merge-commit merge: !2188 merge qwen3omni_ulysses_cp into master feat(torch): Qwen3-Omni support ulysses cp / fix(torch): repeat_kv and activation_offload bug Created-by: yaoyaoxu Commit-by: yaoyaoxu Merged-by: ascend-robot Description: ## Motivation 当前序列长度无法支持128K,需要适配CP以支持128K长序列 ## Modification 1.Qwen3-Omni支持ulysses cp:对AuT、ViT、LLM都做了适配; 如果开启CP,但没有传入音频数据或CP size > = seq_len,则不对音频模块做CP处理 2.修复repeat_kv的bug 3.修复开启activation_offload配置时的内存泄漏bug ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!21882 个月前
MM disttrain intervl2 适配自动并行搜索 Co-authored-by: gcw_amOUPDs9<fuyuefeng@huawei.com> # message auto-generated for no-merge-commit merge: !1588 merge master into master MM disttrain intervl2 适配自动并行搜索 Created-by: gcw_amOUPDs9 Commit-by: gcw_amOUPDs9 Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!15886 个月前
[Docs] Annotation Standardization Rectification Co-authored-by: LKONE<wanglikai4@huawei.com> # message auto-generated for no-merge-commit merge: !2010 merge master into master [Docs] Annotation Standardization Rectification Created-by: wanglikai1019 Commit-by: LKONE Merged-by: ascend-robot Description: ## Motivation Rectify the annotations for the code repository in accordance with the annotation specification requirements. ## Modification 1. Replace Chinese annotations with English ones; 2. Organize the environment variable documentation and add explanations in the startup scripts (currently implemented on Qwen3VL and Wan2.2); 3. Add annotations to the public dataset functions and public model classes of Wan2.2 and Qwen3VL; 4. Adjust the annotation format (e.g., number of indentations). ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!20104 个月前
!587 【特性】VAE添加判别器以及数据集等模块 Merge pull request !587 from zs-Derrick/master 1 年前
[bugfix] fix bug for qwenvl2.5 model with hetero parallel Co-authored-by: xiaoyue994<xiaoyuanhang@huawei.com> # message auto-generated for no-merge-commit merge: !2213 merge master into master [bugfix] fix bug for qwenvl2.5 model with hetero parallel Created-by: xiaoyue994 Commit-by: xiaoyue994 Merged-by: ascend-robot Description: ## Motivation The vl model lacks an audio module, resulting in a key error. ![image.png](https://raw.gitcode.com/user-images/assets/7404510/1b93768e-7820-44c3-b454-28c14d5fbafd/image.png 'image.png') ## Modification Skip for loop when the audio module parameter is None for vl model. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!22133 个月前
!259 【特性】OpenSoraPlan 1.3 添加 I2V 训练 Merge pull request !259 from zzztq/master 1 年前
!1492 [Modify] Update deterministic computation Merge pull request !1492 from 王泽/deter_comp 8 个月前
[mindspore] support qwen3vl finetune for megatron backend Co-authored-by: ffmh<fengminghao2@huawei.com> # message auto-generated for no-merge-commit merge: !1785 merge qwen3vl_megatron into master [mindspore] support qwen3vl finetune for megatron backend Created-by: ffmh Commit-by: ffmh Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. 适配 qwen3vl 8B/30B megatron后端 ## Modification Please briefly describe what modification is made in this PR. 添加 layer_spec修改、text模块transformer block/layer添加deepstack相关适配 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17856 个月前
[Feature]Add split&gather function for ViT RingAttention, bugfix for qwen3vl Co-authored-by: zs-derrick1<1434012475@qq.com> # message auto-generated for no-merge-commit merge: !1824 merge ring into master [Feature]Add split&gather function for ViT RingAttention, bugfix for qwen3vl Created-by: zs-derrick1 Commit-by: zs-derrick1 Merged-by: ascend-robot Description: ## Motivation Add split&gather function for ViT RingAttention, bugfix for qwen3vl ## Modification Add split&gather function for ViT RingAttention Bugfix for qwen3vl ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!18245 个月前