文件最后提交记录最后更新时间
move mindio_ttp adaptor to MindSpeed Co-authored-by: wangguoyan<wangguoyan6@h-partners.com> # message auto-generated for no-merge-commit merge: !3612 merge master_ha into master [pytorch][feature][mindio] Integration of high availability-related adaptation code Created-by: guoywang Commit-by: wangguoyan Merged-by: ascend-robot Description: move mindio_ttp adaptor to MindSpeed-LLM See merge request: Ascend/MindSpeed-LLM!36126 个月前
[pytorch][bugfix][mindcluster] fix elastic-training scale-in failedwhen pp is 2 Co-authored-by: 李鸣沼<lmztju@126.com> # message auto-generated for no-merge-commit merge: !3853 merge master into master [pytorch][bugfix][mindcluster] fix elastic-training scale-in failedwhen pp is 2 Created-by: lmztju Commit-by: 李鸣沼 Merged-by: ascend-robot Description: [pytorch][bugfix][mindcluster] fix elastic-training scale-in failedwhen pp is 2 See merge request: Ascend/MindSpeed-LLM!38535 个月前
[pytorch][mindio][feature]Ensure that the ACP Level 1 asynchronous save feature is compatible with TFT online recovery. Co-authored-by: z30027952<zengyihang2@h-partners.com> # message auto-generated for no-merge-commit merge: !4103 merge acp_tft_compatibility into master [pytorch][mindio][feature]Ensure that the ACP Level 1 asynchronous save feature is compatible with TFT online recovery. Created-by: zengyihang Commit-by: z30027952 Merged-by: ascend-robot Description: [pytorch][mindio][feature]高可用支持ACP&TFT能力兼容,使训练过程中ACP一级异步保存能力和TFT在线恢复能力同时生效 See merge request: Ascend/MindSpeed-LLM!41033 个月前
[pytorch][feature][mindio] remove deprecated code Co-authored-by: pengchenhui<pengchenhui2@huawei.com> # message auto-generated for no-merge-commit merge: !3703 merge master_gitcode into master [pytorch][feature][mindio] remove deprecated code Created-by: qq_35564531 Commit-by: pengchenhui Merged-by: ascend-robot Description: remove deprecated Code. See merge request: Ascend/MindSpeed-LLM!37036 个月前