| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
chore(pytorch):optimize GLM5 pretraining script Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com> # message auto-generated for no-merge-commit merge: !4340 merge glm into master chore(pytorch):optimize GLM5 pretraining script Created-by: HANHU1CHEN Commit-by: HanhuiChen Merged-by: ascend-robot Description: ## What this PR does / why we need it? This PR updates the GLM5 pretraining script to improve training performance and efficiency. ## Does this PR introduce any user-facing change? No major user-facing changes, but minor adjustments to configs or launch commands may be required. ## How was this patch tested? Validated through end-to-end pretraining runs, confirming improved performance and stable training behavior. See merge request: Ascend/MindSpeed-LLM!4340 | 3 个月前 | |
[pytorch][model]add megatron weight-converter for glm5 Co-authored-by: l00961550<liuqingyuan19@huawei.com> # message auto-generated for no-merge-commit merge: !4162 merge master into master [pytorch][model]add megatron weight-converter for glm5 Created-by: lqy16 Commit-by: l00961550 Merged-by: ascend-robot Description: add megatron weight-converter for glm5 See merge request: Ascend/MindSpeed-LLM!4162 | 4 个月前 | |
[pytorch][model]add pretrain_glm5_744b_4k_A3_ptd.sh Co-authored-by: EVA1<jingsiyu1@huawei.com> # message auto-generated for no-merge-commit merge: !4189 merge master into master [pytorch][model]add pretrain_glm5_744b_4k_A3_ptd.sh Created-by: EVA1 Commit-by: EVA1 Merged-by: ascend-robot Description: add pretrain_glm5_744b_4k_A3_ptd.sh See merge request: Ascend/MindSpeed-LLM!4189 | 4 个月前 | |
fix(pytorch):add ckpt-format argument to scripts Co-authored-by: z__y<z4t155664@163.com> # message auto-generated for no-merge-commit merge: !4371 merge add_ckpt_torch_dist_argument_for_shells into master fix(pytorch):add ckpt-format argument to scripts Created-by: z__y Commit-by: z__y Merged-by: ascend-robot Description: ## What this PR does / why we need it? This PR explicitly adds ckpt-format torch to all repository scripts to support the asynchronous checkpoint saving feature. ## Does this PR introduce any user-facing change? No. This change only adjusts internal script parameters to maintain existing behavior. There are no user-facing API or usage changes. ## How was this patch tested? Tests confirm that asynchronous checkpoint saving works correctly and that the original torch format checkpoint behavior is preserved. See merge request: Ascend/MindSpeed-LLM!4371 | 2 个月前 | |
feat(pytorch): add GLM5 poc script Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com> # message auto-generated for no-merge-commit merge: !4457 merge glm into master feat(pytorch): add GLM5 poc script Created-by: HANHU1CHEN Commit-by: HanhuiChen Merged-by: ascend-robot Description: ## What this PR does / why we need it? Add GLM-5 script for POC. ## Does this PR introduce any user-facing change? No. ## How was this patch tested? We have tested this scipt with full-parameter model in 32 nodes。 See merge request: Ascend/MindSpeed-LLM!4457 | 1 个月前 |
| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
| 3 个月前 | ||
| 4 个月前 | ||
| 4 个月前 | ||
| 2 个月前 | ||
| 1 个月前 |