| [pytorch][bugfix] fixes the accuracy bug for the TND. | 4 个月前 |
| fix(pytorch):Ensure no PP/VPP stage contains only empty layers during LoRA fine-tuning | 2 个月前 |
| [pytorch][bugfix]fix about cleancode | 8 个月前 |
| [pytorch][bugfix]fix about cleancode | 8 个月前 |
| [mindspore][bugfix]adapt qwen3_235b for mindspore | 6 个月前 |
| [pytorch][bugfix]fix deepseek3 tnd bug in mbs > 1 | 3 个月前 |
| fix(pytorch):Ensure no PP/VPP stage contains only empty layers during LoRA fine-tuning | 2 个月前 |
| !3163 [pytorch][feature]switch megatron_adaptor to v2 | 9 个月前 |