Star179
223
代码介绍
代码
Issues49
Pull Requests55
流水线
Actions
讨论
Wiki
项目成员54
分析
项目设置
Star179
223
  1. MindSpeed-LLM
  2. /
  3. tests
  4. /
  5. pipeline
  6. /
  7. ut
  8. /
  9. posttrain
  10. /
  11. ldt_sft
ascend-robotascend-robotfix: LDT training supports qwen25-72b model
309cbccf创建于 1 天前历史提交
文件最后提交记录最后更新时间
test_convert_ckpt_pp_vpp.py
feat(pytorch): support different TP configuration on edge and cloud for layerwise_disaggregated_training1 个月前
test_core_utils.py
feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training22 天前
test_distributed_data_parallel.py
feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training22 天前
test_initialize.py
test(megatron): pipeline ut testcase fix9 天前
test_ldt_sft_trainer.py
feat(pytorch): support different TP configuration on edge and cloud for layerwise_disaggregated_training1 个月前
test_parallel_state.py
feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training22 天前
test_recompute_adaptor.py
fix: LDT training supports qwen25-72b model1 天前
test_recompute_common.py
fix: LDT training supports qwen25-72b model1 天前
test_training.py
feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training22 天前
test_utils.py
feat(pytorch): support different DP or DP&TP configuration on edge and cloud for layerwise_disaggregated_training22 天前