| feat(pytorch): support different TP configuration on edge and cloud for layerwise_disaggregated_training | 1 个月前 |
| feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training | 22 天前 |
| feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training | 22 天前 |
| test(megatron): pipeline ut testcase fix | 9 天前 |
| feat(pytorch): support different TP configuration on edge and cloud for layerwise_disaggregated_training | 1 个月前 |
| feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training | 22 天前 |
| fix: LDT training supports qwen25-72b model | 1 天前 |
| fix: LDT training supports qwen25-72b model | 1 天前 |
| feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training | 22 天前 |
| feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training | 22 天前 |