| [Feature] qwen3.5 support new dcp convert and fused rmsnorm
Co-authored-by: young256<liumingyang16@huawei.com>
# message auto-generated for no-merge-commit merge:
!2246 merge fsdp2_dev_qwen35_publish into master
[Feature] qwen3.5 support new dcp convert and fused rmsnorm
Created-by: young256
Commit-by: young256
Merged-by: ascend-robot
Description: ## What this PR does / why we need it?
Qwen3.5 needs to support new DCP weight conversion to reduce the vector operator overhead caused by weight transpose in the program. The mul vector operator caused by RMSNorm and RMSNormGate needs to be eliminated.
## Does this PR introduce any user-facing change?
DCP weight conversion instructions have changed.
## How was this patch tested?
The weight verification after weight conversion is valid. The fused operator replacement has completed precision alignment.
See merge request: Ascend/MindSpeed-MM!2246 | 2 个月前 |