Star181
225
代码介绍
代码
Issues66
Pull Requests54
流水线
Actions
讨论
Wiki
项目成员54
分析
项目设置
Star181
225
  1. MindSpeed-LLM
  2. /
  3. tests
  4. /
  5. poc
  6. /
  7. qwen3-moe
ascend-robotascend-robotfix(pytorch):add ckpt-format argument to scripts
251f267c创建于 4月3日历史提交
文件最后提交记录最后更新时间
pretrain_qwen3_235b_a22b_4k_A2_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前
pretrain_qwen3_235b_a22b_4k_A3_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前
pretrain_qwen3_235b_a22b_8K_A2_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前
pretrain_qwen3_30b_a3b_32K_A2_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前
pretrain_qwen3_30b_a3b_4K_A2_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前
pretrain_qwen3_30b_a3b_4K_A3_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前
tune_qwen3_30b_a3b_256K_full_pack_A3_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前
tune_qwen3_30b_a3b_32K_full_pack_A+X_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前
tune_qwen3_30b_a3b_32K_full_pack_A3_ptd.sh
fix(pytorch):add ckpt-format argument to scripts2 个月前