文件最后提交记录最后更新时间
[Bugfix] 修复qwen3vl dcp转hf bug Co-authored-by: yangx_sy<sunyang49@huawei.com> # message auto-generated for no-merge-commit merge: !1766 merge qwen3vl_readme into master [Bugfix] fix bug when qwen3vl dcp to hf Created-by: yangx_sy Commit-by: yangx_sy Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17666 个月前
[Feature]VACE-Wan2.2 Support Co-authored-by: feng0w0<houyufeng4@huawei.com> # message auto-generated for no-merge-commit merge: !1719 merge master into master [Feature]VACE-Wan2.2 Support Created-by: feng0w0 Commit-by: feng0w0 Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Support VACE Wan2.2 and inference ## Self-test (Optional) Update VACE readme. Add VACE Wan2.2 training scripts Add VACE inference script ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17196 个月前
[bugfix]fix get num_experts error in mm_2_hf权重转换 Co-authored-by: chengpeng25<chengpeng9@huawei.com> # message auto-generated for no-merge-commit merge: !1918 merge dev_cp_fix_230 into 2.3.0 [bugfix]fix get num_experts error in mm_2_hf权重转换 Created-by: chengpeng25 Commit-by: chengpeng25 Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. 修复mm_2_hf权重转换时,InterVl模型取num_experts无对应属性问题 issue: https://gitcode.com/Ascend/MindSpeed-MM/issues/186 ## Modification Please briefly describe what modification is made in this PR. 校验config中是否有'text_config',没有的话给默认值1. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. 修改后,Intervl3-8b模型hf_2_mm,mm_2_hf执行都正常 ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!19185 个月前
[mindspore] ckpt convert for qwen3vl between hf and megatron format Co-authored-by: chengpeng25<chengpeng9@huawei.com> # message auto-generated for no-merge-commit merge: !1765 merge dev_convert_ckpt into master [mindspore] ckpt convert for qwen3vl between hf and megatron format Created-by: chengpeng25 Commit-by: chengpeng25 Merged-by: ascend-robot Description: ## Motivation Ckpt converter between huggingface and megatron format for qwen3vl. ## Modification qwen3vl ckpt converter between hf and megatron, including Dense and moe models. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17656 个月前
[feature]support moe expert merge weight convert from hf to dcp or vice versa Co-authored-by: pjgao<gaopengju3@huawei.com> # message auto-generated for no-merge-commit merge: !1752 merge master into master [feature]support moe expert merge weight convert from hf to dcp or vice versa Created-by: PIPIXIU Commit-by: pjgao Merged-by: ascend-robot Description: ## Motivation InternVL3.5使用MOE专家合并的方式需要权重转换,当前仓上是两段转换,先将原仓safetensors转换成合并专家后的safetensors,再转换成torch dcp,因此本PR新增转换接口mm-convert ExpertMergeDcpConverter hf_to_dcp /mm-convert ExpertMergeDcpConverter dcp_to_hf ## Modification 1. 增加转换类ExpertMergeDcpConverter 2. 抽象Commandable管理所有的命令行接口,提供add_command接口,允许增加函数到命令行中 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17526 个月前