| feat(pytorch): Add MindSpeed Muon feature
Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com>
# message auto-generated for no-merge-commit merge:
!4549 merge master into master
feat(pytorch): Add MindSpeed Muon feature
Created-by: HANHU1CHEN
Commit-by: HanhuiChen
Merged-by: ascend-robot
Description: ## What this PR does / why we need it?
Replaces the in-repo self-maintained Muon optimizer with MindSpeed's native Muon implementation, removing the legacy code and adapting the patch registration accordingly.
## Does this PR introduce any user-facing change?
No change to the Muon usage interface; existing Muon training scripts and arguments continue to work. The underlying implementation is switched to MindSpeed's native version.
## How was this patch tested?
Precision has been verified: training with the native Muon optimizer was aligned against the previous self-maintained implementation, with consistent loss and grad-norm behavior.
See merge request: Ascend/MindSpeed-LLM!4549 | 13 小时前 |