| feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training
Co-authored-by: yanzhenghao<yanzhenghao2@huawei.com>
Co-authored-by: xuguoliang3<xuguoliang3@huawei.com>
Co-authored-by: fangminghao<fangminghao@huawei.com>
# message auto-generated for no-merge-commit merge:
!4437 merge 20260409_vdp into master
feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training
Created-by: xuguoliang3
Commit-by: xuguoliang3;yanzhenghao;fangminghao
Merged-by: ascend-robot
Description:
## What this PR does / why we need it?
support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training
## Does this PR introduce any user-facing change?
Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path.
## How was this patch tested?
Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations.
See merge request: Ascend/MindSpeed-LLM!4437 | 14 天前 |