MindSpeed-LLM/examples/mcore/glm5 · Ascend/MindSpeed-LLM - AtomGit

ascend-robotfeat(pytorch): add GLM5 poc script

文件	最后提交记录	最后更新时间
ckpt_convert_glm5_hf2mcore.sh	chore(pytorch):optimize GLM5 pretraining script Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com> # message auto-generated for no-merge-commit merge: !4340 merge glm into master chore(pytorch):optimize GLM5 pretraining script Created-by: HANHU1CHEN Commit-by: HanhuiChen Merged-by: ascend-robot Description: ## What this PR does / why we need it? This PR updates the GLM5 pretraining script to improve training performance and efficiency. ## Does this PR introduce any user-facing change? No major user-facing changes, but minor adjustments to configs or launch commands may be required. ## How was this patch tested? Validated through end-to-end pretraining runs, confirming improved performance and stable training behavior. See merge request: Ascend/MindSpeed-LLM!4340	3 个月前
ckpt_convert_glm5_mcore2hf.sh	[pytorch][model]add megatron weight-converter for glm5 Co-authored-by: l00961550<liuqingyuan19@huawei.com> # message auto-generated for no-merge-commit merge: !4162 merge master into master [pytorch][model]add megatron weight-converter for glm5 Created-by: lqy16 Commit-by: l00961550 Merged-by: ascend-robot Description: add megatron weight-converter for glm5 See merge request: Ascend/MindSpeed-LLM!4162	4 个月前
data_convert_glm5_pretrain.sh	[pytorch][model]add pretrain_glm5_744b_4k_A3_ptd.sh Co-authored-by: EVA1<jingsiyu1@huawei.com> # message auto-generated for no-merge-commit merge: !4189 merge master into master [pytorch][model]add pretrain_glm5_744b_4k_A3_ptd.sh Created-by: EVA1 Commit-by: EVA1 Merged-by: ascend-robot Description: add pretrain_glm5_744b_4k_A3_ptd.sh See merge request: Ascend/MindSpeed-LLM!4189	4 个月前
generate_glm5_744b_A3_ptd.sh	fix(pytorch):add ckpt-format argument to scripts Co-authored-by: z__y<z4t155664@163.com> # message auto-generated for no-merge-commit merge: !4371 merge add_ckpt_torch_dist_argument_for_shells into master fix(pytorch):add ckpt-format argument to scripts Created-by: z__y Commit-by: z__y Merged-by: ascend-robot Description: ## What this PR does / why we need it? This PR explicitly adds ckpt-format torch to all repository scripts to support the asynchronous checkpoint saving feature. ## Does this PR introduce any user-facing change? No. This change only adjusts internal script parameters to maintain existing behavior. There are no user-facing API or usage changes. ## How was this patch tested? Tests confirm that asynchronous checkpoint saving works correctly and that the original torch format checkpoint behavior is preserved. See merge request: Ascend/MindSpeed-LLM!4371	2 个月前
pretrain_glm5_744b_4k_A3_ptd.sh	feat(pytorch): add GLM5 poc script Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com> # message auto-generated for no-merge-commit merge: !4457 merge glm into master feat(pytorch): add GLM5 poc script Created-by: HANHU1CHEN Commit-by: HanhuiChen Merged-by: ascend-robot Description: ## What this PR does / why we need it? Add GLM-5 script for POC. ## Does this PR introduce any user-facing change? No. ## How was this patch tested? We have tested this scipt with full-parameter model in 32 nodes。 See merge request: Ascend/MindSpeed-LLM!4457	1 个月前