Fork
0
代码
介绍
代码
Issues
Pull Requests
流水线
Actions
讨论
Wiki
项目成员
分析
项目设置
Fork
0
2.2.0
MindSpeed-LLM
/
examples
/
mcore
下载当前目录
ascend-robot
[pytorch][bugfix]fix script of glm45-moe
16cc49ce
创建于
2月2日
历史提交
文件
最后提交记录
最后更新时间
baichuan2
[pytorch][bugfix] baichaun2 no-fa-adapt Co-authored-by: jzh6229<jiangzhihui4@huawei.com>
7 个月前
chatglm3
!3110
[pytorch][sh]add CUDA_DEVICE_MAX_CONNECTION in convert_ckpt scripts Merge pull request
!3110
from sunjunjie/master
9 个月前
codellama
!3110
[pytorch][sh]add CUDA_DEVICE_MAX_CONNECTION in convert_ckpt scripts Merge pull request
!3110
from sunjunjie/master
9 个月前
deepseek2
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge:
!3555
merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM
!3555
7 个月前
deepseek25
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge:
!3555
merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM
!3555
7 个月前
deepseek2_coder
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge:
!3555
merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM
!3555
7 个月前
deepseek2_lite
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge:
!3555
merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM
!3555
7 个月前
deepseek3
[pytorch][bugfix]fix script of ds3 lora2hf Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge:
!4119
merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix script of ds3 lora2hf Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix script of ds3 lora2hf See merge request: Ascend/MindSpeed-LLM
!4119
4 个月前
deepseek_math
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
deepseek_r1_distill_llama
!2957
[pytorch][bugfix]fix srcipt of promot-type overlap Merge pull request
!2957
from qu_yueze/master
10 个月前
deepseek_r1_distill_qwen
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
gemma
!3229
[pytorch][bugfix] gemma bugfix Merge pull request
!3229
from Peihan Liu/master
8 个月前
gemma2
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
glm4
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
glm45-moe
[pytorch][bugfix]fix script of glm45-moe Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge:
!4132
merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix script of glm45-moe Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix script of glm45-moe See merge request: Ascend/MindSpeed-LLM
!4132
3 个月前
gpt4
!3174
[pytorch][bugfix]fix gpt4 dropless assertionError Merge pull request
!3174
from sunjunjie/master
9 个月前
hunyuanLarge
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge:
!3555
merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM
!3555
7 个月前
internlm2
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
internlm25
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
internlm3
[pytorch][bugfix]fix bug of script of internlm3 Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge:
!3961
merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix bug of script of internlm3 Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix bug of script of internlm3 See merge request: Ascend/MindSpeed-LLM
!3961
5 个月前
kimi2
!3163
[pytorch][feature]switch megatron_adaptor to v2 Merge pull request
!3163
from yanzhixiao/switch-v2
9 个月前
ling_v2
[pytorch][bugfix] update ling-mini not support mtp Co-authored-by: jzh6229<jiangzhihui4@huawei.com> # message auto-generated for no-merge-commit merge:
!3948
merge 2.2.0 into 2.2.0 [pytorch][bugfix] update ling-mini not support mtp Created-by: jzh6229 Commit-by: jzh6229 Merged-by: ascend-robot Description: [pytorch][bugfix] update ling-mini not support mtp See merge request: Ascend/MindSpeed-LLM
!3948
5 个月前
llama2
!3252
[pytorch][bugfix]Fix LoRA finetune bug Merge pull request
!3252
from sunjunjie/master
8 个月前
llama3
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>
7 个月前
llama31
[pytorch][bugfix]fix llama31-405b sh Co-authored-by: guozhihua<guozhihua2@huawei.com>
7 个月前
llama32
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
llama33
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
mamba2
!3037
[pytorch][bugfix]fix ckpt of mamba2 Merge pull request
!3037
from qu_yueze/master
10 个月前
minicpm
[pytorch][model]update minicp8*2b sh Co-authored-by: guozhihua<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge:
!3496
merge minicpm_82b_update_2.2 into 2.2.0 [pytorch][model]update minicp8*2b sh Created-by: guozhihua2 Commit-by: guozhihua Merged-by: ascend-robot Description: add recompute-activation-function in minicp8*2b sh See merge request: Ascend/MindSpeed-LLM
!3496
7 个月前
minicpm3
!2971
[pytorch][refactor]MLA module upgrade, parameter alignment with Megatron. Merge pull request
!2971
from mhh001/master
10 个月前
mistral
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
mixtral
[pytorch][doc]doc file update Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge:
!3535
merge br_220_update_doc_file_1017 into 2.2.0 [pytorch][doc]doc file update Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: 2.2.0 doc file update See merge request: Ascend/MindSpeed-LLM
!3535
7 个月前
phi35
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
qwen15
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
qwen2
!2971
[pytorch][refactor]MLA module upgrade, parameter alignment with Megatron. Merge pull request
!2971
from mhh001/master
10 个月前
qwen25
[pytorch][bugfix]fix hunyuan tune bug Co-authored-by: guozhihua<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge:
!3519
merge huanyuan_tune_fix_2.2 into 2.2.0 [pytorch][bugfix]fix hunyuan tune bug Created-by: guozhihua2 Commit-by: guozhihua Merged-by: ascend-robot Description: 1.change hunyuan tune pad-to-multiple-of to 8 and add use-flash-attn 2.add recompute-activation-function in qwen25-72b-4k-pack See merge request: Ascend/MindSpeed-LLM
!3519
7 个月前
qwen25_coder
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
qwen25_math
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
qwen2_moe
[pytorch][bugfix]fix lora bug in deepseek and qwen25 Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge:
!3584
merge lora_fix_deepseek_and_qwen25_2.2 into 2.2.0 [pytorch][bugfix]fix lora bug in deepseek and qwen25 Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: 1.修复lora单独开启gemm报错问题 2.修复qwen2-57b开启tp报错缺失sp问题 See merge request: Ascend/MindSpeed-LLM
!3584
7 个月前
qwen3
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>
7 个月前
qwen3_moe
[pytorch][model]update qwen3_30b pp4ep4 to pp2ep8 in A2 Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge:
!3832
merge update_qwen3_30b_sh_2.2.0 into 2.2.0 [pytorch][model]update qwen3_30b pp4ep4 to pp2ep8 in A2 Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: 1. 更新qwen3_30b的切分pp4ep4到pp2ep8 See merge request: Ascend/MindSpeed-LLM
!3832
5 个月前
qwen3_next
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>
7 个月前
qwq
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
seed_oss
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>
7 个月前
yi
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前
yi15
!2828
[pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request
!2828
from yanzhixiao/add-environment-variable
11 个月前