| [refactor] Merge npu_expert_parallel into npu_permute | 14 天前 |
| [fix] preserve MoE w13 values when exporting HF weights | 1 天前 |
| [feat] DT support scripts | 2 个月前 |
| Fixed license headers | 1 个月前 |
| [fix] support swap muon optimizer | 9 天前 |
| [fix] 修复 swap optimizer checkpoint 保存加载 | 22 天前 |
| [feat] DT support scripts | 2 个月前 |