Fork
0
代码
介绍
代码
Issues
Pull Requests
流水线
Actions
讨论
Wiki
项目成员
分析
项目设置
Fork
0
2.2.0
MindSpeed-LLM
/
examples
/
mcore
/
deepseek_r1_distill_qwen
下载当前目录
I
i-robot
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS
945aad92
创建于
2025年6月19日
历史提交
文件
最后提交记录
最后更新时间
ckpt_convert_distill_qwen_hf2mcore.sh
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
ckpt_convert_distill_qwen_mcore2hf.sh
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
data_convert_distill_qwen_instruction.sh
!2200
add template deepseek3 Merge pull request
!2200
from wucong/add_distill_qwen
1 年前
generate_distill_qwen_14b.sh
!2200
add template deepseek3 Merge pull request
!2200
from wucong/add_distill_qwen
1 年前
generate_distill_qwen_1point5b.sh
!2200
add template deepseek3 Merge pull request
!2200
from wucong/add_distill_qwen
1 年前
generate_distill_qwen_32b.sh
!2200
add template deepseek3 Merge pull request
!2200
from wucong/add_distill_qwen
1 年前
generate_distill_qwen_7b.sh
!2200
add template deepseek3 Merge pull request
!2200
from wucong/add_distill_qwen
1 年前
tune_distill_qwen_14b_full.sh
!2200
add template deepseek3 Merge pull request
!2200
from wucong/add_distill_qwen
1 年前
tune_distill_qwen_1point5b_full.sh
!2200
add template deepseek3 Merge pull request
!2200
from wucong/add_distill_qwen
1 年前
tune_distill_qwen_32b_full.sh
!2815
[pytorch][bugfix] fix logs bug Merge pull request
!2815
from jzh/master_a
11 个月前
tune_distill_qwen_7b_full.sh
!2200
add template deepseek3 Merge pull request
!2200
from wucong/add_distill_qwen
1 年前