| [pytorch][bugfix]fix -mla-mm-split in ckpt | 4 个月前 |
| !2569 Refact: embedding and rotary embedding | 1 年前 |
| !2710 Compatibility fixes | 1 年前 |
| [pytorch][bugfix]fix about cleancode | 8 个月前 |
| !2712 [pytorch][feature]upgrading Megatron to r0.12.1 | 11 个月前 |
| [pytorch][bugfix]fix lora_target_modules in ckpt save_lora_to_hf | 4 个月前 |
| [pytorch][bugfix]fix deepseek3 tnd bug in mbs > 1 | 3 个月前 |
| [pytorch][bugfix] fix variable in lora moe | 4 个月前 |
| !3355 [pytroch][bugfix] fix bug when using GeneralPretrainHandler to process shareGPT-style datasets | 8 个月前 |
| !1958 整改仓库文件结构 | 1 年前 |
| !1998 rename: repo package name from modellink to mindspeed_llm | 1 年前 |
| !3100 [pytorch][feature]mindio_tft adapt reusefp32 and megatron_adaptor_v2 | 9 个月前 |
| !3100 [pytorch][feature]mindio_tft adapt reusefp32 and megatron_adaptor_v2 | 9 个月前 |