| [Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion
Co-authored-by: meng-coding<wumengjie6@huawei.com>
Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com>
# message auto-generated for no-merge-commit merge:
!1898 merge pick_master_ckpt_to_230 into 2.3.0
[Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion
Created-by: yaoyaoxu
Commit-by: yaoyaoxu;meng-coding
Merged-by: ascend-robot
Description: ## Motivation
Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion
## Modification
Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion
## Self-test (Optional)
If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached.
## BC-breaking (Optional)
If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR.
## Checklist
**Before PR**:
- [ ] The new code needs to comply with the Clean Code specification.
- [ ] The PR content is self-checked, and the expression can be clear and the writing standardized
**After PR**:
- [ ] CLA has been signed and all committers have signed the CLA in this PR.
- [ ] The ci-pipeline is passed, Code Check is passed.
See merge request: Ascend/MindSpeed-MM!1898 | 5 个月前 |