文件最后提交记录最后更新时间
[pytorch][bugfix] baichaun2 no-fa-adapt Co-authored-by: jzh6229<jiangzhihui4@huawei.com> 7 个月前
!3110 [pytorch][sh]add CUDA_DEVICE_MAX_CONNECTION in convert_ckpt scripts Merge pull request !3110 from sunjunjie/master 9 个月前
!3110 [pytorch][sh]add CUDA_DEVICE_MAX_CONNECTION in convert_ckpt scripts Merge pull request !3110 from sunjunjie/master 9 个月前
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!35557 个月前
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!35557 个月前
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!35557 个月前
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!35557 个月前
[pytorch][bugfix]fix script of ds3 lora2hf Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge: !4119 merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix script of ds3 lora2hf Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix script of ds3 lora2hf See merge request: Ascend/MindSpeed-LLM!41194 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!2957 [pytorch][bugfix]fix srcipt of promot-type overlap Merge pull request !2957 from qu_yueze/master 10 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!3229 [pytorch][bugfix] gemma bugfix Merge pull request !3229 from Peihan Liu/master 8 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
[pytorch][bugfix]fix script of glm45-moe Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge: !4132 merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix script of glm45-moe Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix script of glm45-moe See merge request: Ascend/MindSpeed-LLM!41323 个月前
!3174 [pytorch][bugfix]fix gpt4 dropless assertionError Merge pull request !3174 from sunjunjie/master 9 个月前
[pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !3555 merge add-permute-fusion-2.2.0 into 2.2.0 [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: [pytorch][sh] restore the --moe-permute-fusion parameter that was removed in Megatron r0.12.1 See merge request: Ascend/MindSpeed-LLM!35557 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
[pytorch][bugfix]fix bug of script of internlm3 Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge: !3961 merge 2.2.0 into 2.2.0 [pytorch][bugfix]fix bug of script of internlm3 Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: fix bug of script of internlm3 See merge request: Ascend/MindSpeed-LLM!39615 个月前
!3163 [pytorch][feature]switch megatron_adaptor to v2 Merge pull request !3163 from yanzhixiao/switch-v2 9 个月前
[pytorch][bugfix] update ling-mini not support mtp Co-authored-by: jzh6229<jiangzhihui4@huawei.com> # message auto-generated for no-merge-commit merge: !3948 merge 2.2.0 into 2.2.0 [pytorch][bugfix] update ling-mini not support mtp Created-by: jzh6229 Commit-by: jzh6229 Merged-by: ascend-robot Description: [pytorch][bugfix] update ling-mini not support mtp See merge request: Ascend/MindSpeed-LLM!39485 个月前
!3252 [pytorch][bugfix]Fix LoRA finetune bug Merge pull request !3252 from sunjunjie/master 8 个月前
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> 7 个月前
[pytorch][bugfix]fix llama31-405b sh Co-authored-by: guozhihua<guozhihua2@huawei.com> 7 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!3037 [pytorch][bugfix]fix ckpt of mamba2 Merge pull request !3037 from qu_yueze/master 10 个月前
[pytorch][model]update minicp8*2b sh Co-authored-by: guozhihua<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !3496 merge minicpm_82b_update_2.2 into 2.2.0 [pytorch][model]update minicp8*2b sh Created-by: guozhihua2 Commit-by: guozhihua Merged-by: ascend-robot Description: add recompute-activation-function in minicp8*2b sh See merge request: Ascend/MindSpeed-LLM!34967 个月前
!2971 [pytorch][refactor]MLA module upgrade, parameter alignment with Megatron. Merge pull request !2971 from mhh001/master 10 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
[pytorch][doc]doc file update Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !3535 merge br_220_update_doc_file_1017 into 2.2.0 [pytorch][doc]doc file update Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: 2.2.0 doc file update See merge request: Ascend/MindSpeed-LLM!35357 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!2971 [pytorch][refactor]MLA module upgrade, parameter alignment with Megatron. Merge pull request !2971 from mhh001/master 10 个月前
[pytorch][bugfix]fix hunyuan tune bug Co-authored-by: guozhihua<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !3519 merge huanyuan_tune_fix_2.2 into 2.2.0 [pytorch][bugfix]fix hunyuan tune bug Created-by: guozhihua2 Commit-by: guozhihua Merged-by: ascend-robot Description: 1.change hunyuan tune pad-to-multiple-of to 8 and add use-flash-attn 2.add recompute-activation-function in qwen25-72b-4k-pack See merge request: Ascend/MindSpeed-LLM!35197 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
[pytorch][bugfix]fix lora bug in deepseek and qwen25 Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !3584 merge lora_fix_deepseek_and_qwen25_2.2 into 2.2.0 [pytorch][bugfix]fix lora bug in deepseek and qwen25 Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: 1.修复lora单独开启gemm报错问题 2.修复qwen2-57b开启tp报错缺失sp问题 See merge request: Ascend/MindSpeed-LLM!35847 个月前
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> 7 个月前
[pytorch][model]update qwen3_30b pp4ep4 to pp2ep8 in A2 Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !3832 merge update_qwen3_30b_sh_2.2.0 into 2.2.0 [pytorch][model]update qwen3_30b pp4ep4 to pp2ep8 in A2 Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: 1. 更新qwen3_30b的切分pp4ep4到pp2ep8 See merge request: Ascend/MindSpeed-LLM!38325 个月前
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> 7 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
[pytorch][bugfix]in variable_seq_lengths mode, set --log-throughput to false Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> 7 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前
!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS Merge pull request !2828 from yanzhixiao/add-environment-variable 11 个月前