| [Bugfix] bugfix for clip grad & empty ep
Co-authored-by: htwang<wanghaitao60@huawei.com>
# message auto-generated for no-merge-commit merge:
!2382 merge 26.0.0 into 26.0.0
[Bugfix] bugfix for clip grad & empty ep
Created-by: htwang
Commit-by: htwang
Merged-by: ascend-robot
Description: ## What this PR does / why we need it?
1、EP使能时,当部分ep rank没有收到tokens时,保持空运算,防止专家参数失去梯度
2、修复不开EP切clip grad norm大于0时,clip grad 计算错误的问题
## Does this PR introduce any user-facing change?
Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path.
## How was this patch tested?
Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations.
See merge request: Ascend/MindSpeed-MM!2382 | 1 个月前 |