文件最后提交记录最后更新时间
fix(pytorch):add ckpt-format argument to scripts2 个月前
test(torch): add deepseekv4 st13 天前
fix(pytorch):add ckpt-format argument to scripts2 个月前
fix(pytorch):add ckpt-format argument to scripts2 个月前
fix(pytorch):add ckpt-format argument to scripts2 个月前
test(python): update dpo st1 天前
fix(pytorch):add ckpt-format argument to scripts2 个月前
feat(pytorch):ring cp support MLA/GQA1 个月前
[pytorch][ci]Add pretrain and finetune ST for FSDP2 backend3 个月前
test(megatron):replace chatglm3_gqa_cp4 testcase with qwen3-8b2 天前
test(megatron): replace llama2_tp2_cp4_general_double_ring testcase with qwen3-8b1 个月前
feat: change ST6 天前
feat(torch):add st16 天前
feat: change ST's llama2_tp4pp2vpp2_tp2d_tpx2tpy2.sh to qwen3_8b_tp2_pp4_vpp2.sh2 天前
test(megatron): replace llama2_tp2_pp4_vpp2_swap testcase with qwen3-8b2 天前
fix(pytorch):add ckpt-format argument to scripts2 个月前
test(pytorch): add st for AscendC GDN kernels, including cp and pack.1 天前
test(pytorch): add st for AscendC GDN kernels, including cp and pack.1 天前
docs(megatron):update testcase develop rule2 个月前
[pytorch][ci]Add pretrain and finetune ST for FSDP2 backend3 个月前
test(megatron): replace_llama3_lora_with_qwen3_8b30 天前
test(megatron): replace tune_llama2_tp1_pp1_qlora testcase with qwen3-8b29 天前