| feat(pytorch): add dsv4 mg2hf
Co-authored-by: qyzqyz<quyueze@h-partners.com>
# message auto-generated for no-merge-commit merge:
!4458 merge master into master
feat(pytorch): add dsv4 mg2hf
Created-by: qyzqyz
Commit-by: qyzqyz
Merged-by: ascend-robot
Description:
## What this PR does / why we need it?
1. add dsv4 mg2hf
- only support pp
- only support etp = 1 or tp = 1
2. fix dsv4 hf2mg vpp
## Does this PR introduce any user-facing change?
if use base model of dsv4 to do mg2hf convert, please set --model-type-hf with deepseek4_base
## How was this patch tested?
Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations.
See merge request: Ascend/MindSpeed-LLM!4458 | 19 天前 |