Fork
0
代码
介绍
代码
Issues
Pull Requests
流水线
Actions
讨论
Wiki
项目成员
分析
项目设置
Fork
0
2.2.0
MindSpeed-LLM
/
examples
/
mcore
/
qwen3_next
下载当前目录
ascend-robot
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false
ad32b3b5
创建于
2025年10月10日
历史提交
文件
最后提交记录
最后更新时间
ckpt_convert_qwen3_next_80b_hf2mcore.sh
!3350
[pytorch][ckpt]add ckpt mtp and mg2hf of qwen3-next Merge pull request
!3350
from jwhk/master
8 个月前
ckpt_convert_qwen3_next_80b_mcore2hf.sh
!3350
[pytorch][ckpt]add ckpt mtp and mg2hf of qwen3-next Merge pull request
!3350
from jwhk/master
8 个月前
data_convert_qwen3_next_instruction.sh
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>
7 个月前
data_convert_qwen3_next_pretrain.sh
!3328
[pytorch][sh]add qwen3-next sh Merge pull request
!3328
from guozhihua/qwen3_next_sh
8 个月前
evaluate_qwen3_next_80b_ptd.sh
!3339
[pytorch][sh]add qwen3_next tune sh and update other sh Merge pull request
!3339
from guozhihua/qwen3_next_sh_fix
8 个月前
genarate_qwen3_next_80b_ptd.sh
!3339
[pytorch][sh]add qwen3_next tune sh and update other sh Merge pull request
!3339
from guozhihua/qwen3_next_sh_fix
8 个月前
pretrain_qwen3_next_80b_4K_A3_ptd.sh
!3339
[pytorch][sh]add qwen3_next tune sh and update other sh Merge pull request
!3339
from guozhihua/qwen3_next_sh_fix
8 个月前
tune_qwen3_next_80b_4K_full_ptd.sh
!3339
[pytorch][sh]add qwen3_next tune sh and update other sh Merge pull request
!3339
from guozhihua/qwen3_next_sh_fix
8 个月前