MindSpeed-LLM/examples/mcore · Ascend/MindSpeed-LLM - AtomGit

ascend-robot[pytorch][bugfix]fix script of glm45-moe

文件	最后提交记录	最后更新时间
baichuan2	[pytorch][bugfix] baichaun2 no-fa-adapt Co-authored-by: jzh6229<jiangzhihui4@huawei.com>	7 个月前
chatglm3	!3110 [pytorch][sh]add CUDA_DEVICE_MAX_CONNECTION in convert_ckpt scripts Merge pull request !3110 from sunjunjie/master	9 个月前
codellama	!3110 [pytorch][sh]add CUDA_DEVICE_MAX_CONNECTION in convert_ckpt scripts Merge pull request !3110 from sunjunjie/master	9 个月前
deepseek2	[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!3555	7 个月前
deepseek25	[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!3555	7 个月前
deepseek2_coder	[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!3555	7 个月前
deepseek2_lite	[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!3555	7 个月前
deepseek3	[pytorch][bugfix]fix script of ds3 lora2hf Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge: !4119 merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix script of ds3 lora2hf Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix script of ds3 lora2hf See merge request: Ascend/MindSpeed-LLM!4119	4 个月前
deepseek_math	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
deepseek_r1_distill_llama	!2957 [pytorch][bugfix]fix srcipt of promot-type overlap Merge pull request !2957 from qu_yueze/master	10 个月前
deepseek_r1_distill_qwen	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
gemma	!3229 [pytorch][bugfix] gemma bugfix Merge pull request !3229 from Peihan Liu/master	8 个月前
gemma2	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
glm4	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
glm45-moe	[pytorch][bugfix]fix script of glm45-moe Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge: !4132 merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix script of glm45-moe Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix script of glm45-moe See merge request: Ascend/MindSpeed-LLM!4132	3 个月前
gpt4	!3174 [pytorch][bugfix]fix gpt4 dropless assertionError Merge pull request !3174 from sunjunjie/master	9 个月前
hunyuanLarge	[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!3555	7 个月前
internlm2	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
internlm25	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
internlm3	[pytorch][bugfix]fix bug of script of internlm3 Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge: !3961 merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix bug of script of internlm3 Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix bug of script of internlm3 See merge request: Ascend/MindSpeed-LLM!3961	5 个月前
kimi2	!3163 [pytorch][feature]switch megatron_adaptor to v2 Merge pull request !3163 from yanzhixiao/switch-v2	9 个月前
ling_v2	[pytorch][bugfix] update ling-mini not support mtp Co-authored-by: jzh6229<jiangzhihui4@huawei.com> # message auto-generated for no-merge-commit merge: !3948 merge 2.2.0 into 2.2.0 [pytorch][bugfix] update ling-mini not support mtp Created-by: jzh6229 Commit-by: jzh6229 Merged-by: ascend-robot Description: [pytorch][bugfix] update ling-mini not support mtp See merge request: Ascend/MindSpeed-LLM!3948	5 个月前
llama2	!3252 [pytorch][bugfix]Fix LoRA finetune bug Merge pull request !3252 from sunjunjie/master	8 个月前
llama3	[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>	7 个月前
llama31	[pytorch][bugfix]fix llama31-405b sh Co-authored-by: guozhihua<guozhihua2@huawei.com>	7 个月前
llama32	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
llama33	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
mamba2	!3037 [pytorch][bugfix]fix ckpt of mamba2 Merge pull request !3037 from qu_yueze/master	10 个月前
minicpm	[pytorch][model]update minicp82b sh Co-authored-by: guozhihua<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !3496 merge minicpm_82b_update_2.2 into 2.2.0 [pytorch][model]update minicp82b sh Created-by: guozhihua2 Commit-by: guozhihua Merged-by: ascend-robot Description: add recompute-activation-function in minicp8*2b sh See merge request: Ascend/MindSpeed-LLM!3496	7 个月前
minicpm3	!2971 [pytorch][refactor]MLA module upgrade, parameter alignment with Megatron. Merge pull request !2971 from mhh001/master	10 个月前
mistral	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
mixtral	[pytorch][doc]doc file update Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !3535 merge br_220_update_doc_file_1017 into 2.2.0 [pytorch][doc]doc file update Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: 2.2.0 doc file update See merge request: Ascend/MindSpeed-LLM!3535	7 个月前
phi35	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
qwen15	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
qwen2	!2971 [pytorch][refactor]MLA module upgrade, parameter alignment with Megatron. Merge pull request !2971 from mhh001/master	10 个月前
qwen25	[pytorch][bugfix]fix hunyuan tune bug Co-authored-by: guozhihua<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !3519 merge huanyuan_tune_fix_2.2 into 2.2.0 [pytorch][bugfix]fix hunyuan tune bug Created-by: guozhihua2 Commit-by: guozhihua Merged-by: ascend-robot Description: 1.change hunyuan tune pad-to-multiple-of to 8 and add use-flash-attn 2.add recompute-activation-function in qwen25-72b-4k-pack See merge request: Ascend/MindSpeed-LLM!3519	7 个月前
qwen25_coder	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
qwen25_math	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
qwen2_moe	[pytorch][bugfix]fix lora bug in deepseek and qwen25 Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !3584 merge lora_fix_deepseek_and_qwen25_2.2 into 2.2.0 [pytorch][bugfix]fix lora bug in deepseek and qwen25 Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: 1.修复lora单独开启gemm报错问题 2.修复qwen2-57b开启tp报错缺失sp问题 See merge request: Ascend/MindSpeed-LLM!3584	7 个月前
qwen3	[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>	7 个月前
qwen3_moe	[pytorch][model]update qwen3_30b pp4ep4 to pp2ep8 in A2 Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !3832 merge update_qwen3_30b_sh_2.2.0 into 2.2.0 [pytorch][model]update qwen3_30b pp4ep4 to pp2ep8 in A2 Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: 1. 更新qwen3_30b的切分pp4ep4到pp2ep8 See merge request: Ascend/MindSpeed-LLM!3832	5 个月前
qwen3_next	[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>	7 个月前
qwq	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
seed_oss	[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com>	7 个月前
yi	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前
yi15	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable	11 个月前