| fix(pytorch):add ckpt-format argument to scripts | 2 个月前 |
| test(torch): add deepseekv4 st | 13 天前 |
| fix(pytorch):add ckpt-format argument to scripts | 2 个月前 |
| fix(pytorch):add ckpt-format argument to scripts | 2 个月前 |
| fix(pytorch):add ckpt-format argument to scripts | 2 个月前 |
| test(python): update dpo st | 1 天前 |
| fix(pytorch):add ckpt-format argument to scripts | 2 个月前 |
| feat(pytorch):ring cp support MLA/GQA | 1 个月前 |
| [pytorch][ci]Add pretrain and finetune ST for FSDP2 backend | 3 个月前 |
| test(megatron):replace chatglm3_gqa_cp4 testcase with qwen3-8b | 2 天前 |
| test(megatron): replace llama2_tp2_cp4_general_double_ring testcase with qwen3-8b | 1 个月前 |
| feat: change ST | 6 天前 |
| feat(torch):add st | 16 天前 |
| feat: change ST's llama2_tp4pp2vpp2_tp2d_tpx2tpy2.sh to qwen3_8b_tp2_pp4_vpp2.sh | 2 天前 |
| test(megatron): replace llama2_tp2_pp4_vpp2_swap testcase with qwen3-8b | 2 天前 |
| fix(pytorch):add ckpt-format argument to scripts | 2 个月前 |
| test(pytorch): add st for AscendC GDN kernels, including cp and pack. | 1 天前 |
| test(pytorch): add st for AscendC GDN kernels, including cp and pack. | 1 天前 |
| docs(megatron):update testcase develop rule | 2 个月前 |
| [pytorch][ci]Add pretrain and finetune ST for FSDP2 backend | 3 个月前 |
| test(megatron): replace_llama3_lora_with_qwen3_8b | 30 天前 |
| test(megatron): replace tune_llama2_tp1_pp1_qlora testcase with qwen3-8b | 29 天前 |