| [feature]support moe expert merge weight convert from hf to dcp or vice versa
Co-authored-by: pjgao<gaopengju3@huawei.com>
# message auto-generated for no-merge-commit merge:
!1752 merge master into master
[feature]support moe expert merge weight convert from hf to dcp or vice versa
Created-by: PIPIXIU
Commit-by: pjgao
Merged-by: ascend-robot
Description: ## Motivation
InternVL3.5使用MOE专家合并的方式需要权重转换,当前仓上是两段转换,先将原仓safetensors转换成合并专家后的safetensors,再转换成torch dcp,因此本PR新增转换接口mm-convert ExpertMergeDcpConverter hf_to_dcp /mm-convert ExpertMergeDcpConverter dcp_to_hf
## Modification
1. 增加转换类ExpertMergeDcpConverter
2. 抽象Commandable管理所有的命令行接口,提供add_command接口,允许增加函数到命令行中
## Self-test (Optional)
If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached.
## BC-breaking (Optional)
If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR.
## Checklist
**Before PR**:
- [ ] The new code needs to comply with the Clean Code specification.
- [ ] The PR content is self-checked, and the expression can be clear and the writing standardized
**After PR**:
- [ ] CLA has been signed and all committers have signed the CLA in this PR.
- [ ] The ci-pipeline is passed, Code Check is passed.
See merge request: Ascend/MindSpeed-MM!1752 | 6 个月前 |