| feat(torch): Initialize Qwen3-Omni model code from transformers library for pure fsdp2
Co-authored-by: liuxi_<liuxi75@huawei.com>
# message auto-generated for no-merge-commit merge:
!2307 merge qwen3_omni_moe into master
feat(torch): Initialize Qwen3-Omni model code from transformers library for pure fsdp2
Created-by: liuxi_
Commit-by: liuxi_
Merged-by: ascend-robot
Description: Motivation
初始化Qwen3Omni的模型代码modeling_qwen3_omni_moe.py,本PR中代码来源是transformers库。
Modification
1.修改点:将transformers仓的qwen3omni模型中modeling_qwen3_omni_moe.py导入到文件mindspeed_mm\fsdp\models\qwen3omni\modeling_qwen3_omni_moe.py中;并进行初步裁剪。
2.已验证:构建通过。
Self-test (Optional)
If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached.
BC-breaking (Optional)
If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR.
Checklist
Before PR:
The new code needs to comply with the Clean Code specification.
The PR content is self-checked, and the expression can be clear and the writing standardized
After PR:
CLA has been signed and all committers have signed the CLA in this PR.
The ci-pipeline is passed, Code Check is passed.
See merge request: Ascend/MindSpeed-MM!2307 | 2 个月前 |