| docs: change pr template Co-authored-by: LinShua<707894133@qq.com> # message auto-generated for no-merge-commit merge: !4299 merge master_pr_template3 into master docs: change pr template Created-by: LinShua Commit-by: LinShua Merged-by: ascend-robot Description: ## What this PR does / why we need it? change pr template ## Does this PR introduce any user-facing change? NA ## How was this patch tested? NA See merge request: Ascend/MindSpeed-LLM!4299 | 3 个月前 |
| ci(torch): run st with pytest Co-authored-by: iansheng<shengjiayi@huawei.com> # message auto-generated for no-merge-commit merge: !4403 merge ci412 into master ci(torch): run st with pytest Created-by: iansheng Commit-by: iansheng Merged-by: ascend-robot Description: ## What this PR does / why we need it? Run st test suit with Pytest instead of Shell. ## Does this PR introduce any user-facing change? no ## How was this patch tested? Yes See merge request: Ascend/MindSpeed-LLM!4403 | 2 个月前 |
| fix: fix mg2hf bug, support qkv_split per_group layout for seed-oss mcore2hf conversion Co-authored-by: wanggangguo<wanggangguo@huawei.com> # message auto-generated for no-merge-commit merge: !4656 merge upgrade into master fix: fix mg2hf bug, support qkv_split per_group layout for seed-oss mcore2hf conversion Created-by: isfrapples Commit-by: wanggangguo Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1. Adds seed-oss and qwen25 to the v2 converter's supported model list with proper model_cfg.json configuration. 2. Fixes fused QKV weight splitting by introducing a qkv_split attribute so the converter correctly handles both per-group-interleaved (seed-oss) and contiguous (all other models) memory layouts. 3. Fixes qwen25 conversion producing checkpoints without linear_qkv.bias by registering it as a standalone model type instead of falling back to llama2. ## Does this PR introduce any user-facing change? 1. **seed-oss**: mcore2hf conversion now produces correct Q/K/V weights. 2. **qwen25**: existing checkpoints converted with --model-type-hf llama2 must be re-converted with --model-type-hf qwen25 to include linear_qkv.bias. 3. **All other models**: no behavioral change. ## How was this patch tested? 1. All 5 mcore2hf cases covering qkv_type='unpack' (seed-oss MHA TP2, qwen25 MHA TP4, qwen2-moe GQA, llama2 MQA TP2, llama2 MHA TP1) passed. 2. HF→MG round-trip cases for qwen25 and qwen2-moe passed. 3. Four qwen25 layerwise disaggregated training CI scripts were updated to --model-type-hf qwen25, fixing the previously reported Missing linear_qkv.bias errors. fixes#1551 See merge request: Ascend/MindSpeed-LLM!4656 | 48 分钟前 |
| docs: use absolute license link in docker overview Co-authored-by: wxy1105<wangxinyu136@huawei.com> # message auto-generated for no-merge-commit merge: !4654 merge master_doc into master docs: use absolute license link in docker overview Created-by: wxy1105 Commit-by: wxy1105 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Modify the relative path of the license in the Docker overview to an absolute path. ## Does this PR introduce any user-facing change? Modify the Docker overview ## How was this patch tested? Attempt to click the license entry to verify whether it can jump to the corresponding page. See merge request: Ascend/MindSpeed-LLM!4654 | 1 天前 |
| feat(pytorch): add glm52 in fsdp2 Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !4614 merge add_glm52_in_fsdp2 into master feat(pytorch): add glm52 in fsdp2 Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: ## What this PR does / why we need it? This PR adds GLM52 model support in the FSDP2 training framework. Main changes include: 1. Add the GLM52 model implementation for FSDP2, including model definition, configuration adaptation, and registration logic. 2. Support GLM52 pretraining under the FSDP2 framework, enabling users to launch GLM52 training with the FSDP2 training entry and related scripts. 3. Support GLM52 chat/inference flow, so the adapted GLM52 model can be used for basic generation and chat validation after loading. 4. Adapt GLM52-specific model logic in FSDP2, including attention/indexer-related behavior and model forward compatibility required by GLM52. 5. Provide related ST coverage to ensure the GLM52 FSDP2 model path can be built, loaded, and executed correctly. ## Does this PR introduce any user-facing change? Yes. Users can now use the GLM52 model in the FSDP2 framework, including GLM52 pretraining and chat/inference workflows. Existing model usage is not expected to be affected. ## How was this patch tested? Verified by running the GLM52 ST test. pipeline/st/glm52/pretrain_glm52_38b_4k_fsdp2_A3.sh The test covers the GLM52 FSDP2 model build and execution path, including pretraining-related runtime validation and basic chat/inference functionality. See merge request: Ascend/MindSpeed-LLM!4614 | 14 小时前 |
| feat(pytorch): add glm52 in fsdp2 Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !4614 merge add_glm52_in_fsdp2 into master feat(pytorch): add glm52 in fsdp2 Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: ## What this PR does / why we need it? This PR adds GLM52 model support in the FSDP2 training framework. Main changes include: 1. Add the GLM52 model implementation for FSDP2, including model definition, configuration adaptation, and registration logic. 2. Support GLM52 pretraining under the FSDP2 framework, enabling users to launch GLM52 training with the FSDP2 training entry and related scripts. 3. Support GLM52 chat/inference flow, so the adapted GLM52 model can be used for basic generation and chat validation after loading. 4. Adapt GLM52-specific model logic in FSDP2, including attention/indexer-related behavior and model forward compatibility required by GLM52. 5. Provide related ST coverage to ensure the GLM52 FSDP2 model path can be built, loaded, and executed correctly. ## Does this PR introduce any user-facing change? Yes. Users can now use the GLM52 model in the FSDP2 framework, including GLM52 pretraining and chat/inference workflows. Existing model usage is not expected to be affected. ## How was this patch tested? Verified by running the GLM52 ST test. pipeline/st/glm52/pretrain_glm52_38b_4k_fsdp2_A3.sh The test covers the GLM52 FSDP2 model build and execution path, including pretraining-related runtime validation and basic chat/inference functionality. See merge request: Ascend/MindSpeed-LLM!4614 | 14 小时前 |
| fix: fix mg2hf bug, support qkv_split per_group layout for seed-oss mcore2hf conversion Co-authored-by: wanggangguo<wanggangguo@huawei.com> # message auto-generated for no-merge-commit merge: !4656 merge upgrade into master fix: fix mg2hf bug, support qkv_split per_group layout for seed-oss mcore2hf conversion Created-by: isfrapples Commit-by: wanggangguo Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1. Adds seed-oss and qwen25 to the v2 converter's supported model list with proper model_cfg.json configuration. 2. Fixes fused QKV weight splitting by introducing a qkv_split attribute so the converter correctly handles both per-group-interleaved (seed-oss) and contiguous (all other models) memory layouts. 3. Fixes qwen25 conversion producing checkpoints without linear_qkv.bias by registering it as a standalone model type instead of falling back to llama2. ## Does this PR introduce any user-facing change? 1. **seed-oss**: mcore2hf conversion now produces correct Q/K/V weights. 2. **qwen25**: existing checkpoints converted with --model-type-hf llama2 must be re-converted with --model-type-hf qwen25 to include linear_qkv.bias. 3. **All other models**: no behavioral change. ## How was this patch tested? 1. All 5 mcore2hf cases covering qkv_type='unpack' (seed-oss MHA TP2, qwen25 MHA TP4, qwen2-moe GQA, llama2 MQA TP2, llama2 MHA TP1) passed. 2. HF→MG round-trip cases for qwen25 and qwen2-moe passed. 3. Four qwen25 layerwise disaggregated training CI scripts were updated to --model-type-hf qwen25, fixing the previously reported Missing linear_qkv.bias errors. fixes#1551 See merge request: Ascend/MindSpeed-LLM!4656 | 48 分钟前 |
| feat: compatibility improvements among third-party dependencies Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !4638 merge requirement-fix into master feat: compatibility improvements among third-party dependencies Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1.Improves compatibility with third-party dependencies. 2.Removes version constraints on third-party packages, including numpy, datasets, ray, bitsandbytes-npu-beta, mistral_common, and antlr4-python3-runtime. 3.Adds support for newer versions of PEFT. 4.The transformers version has already been approved and is not included in this update. ## Does this PR introduce any user-facing change? No. This PR only contains compatibility improvements and does not introduce any user-facing changes. ## How was this patch tested? 1.Passed smoke tests in the Blue Zone. 2.Passed ST validation in the LLM repository. See merge request: Ascend/MindSpeed-LLM!4638 | 3 天前 |
| fix: fix mg2hf bug, support qkv_split per_group layout for seed-oss mcore2hf conversion Co-authored-by: wanggangguo<wanggangguo@huawei.com> # message auto-generated for no-merge-commit merge: !4656 merge upgrade into master fix: fix mg2hf bug, support qkv_split per_group layout for seed-oss mcore2hf conversion Created-by: isfrapples Commit-by: wanggangguo Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1. Adds seed-oss and qwen25 to the v2 converter's supported model list with proper model_cfg.json configuration. 2. Fixes fused QKV weight splitting by introducing a qkv_split attribute so the converter correctly handles both per-group-interleaved (seed-oss) and contiguous (all other models) memory layouts. 3. Fixes qwen25 conversion producing checkpoints without linear_qkv.bias by registering it as a standalone model type instead of falling back to llama2. ## Does this PR introduce any user-facing change? 1. **seed-oss**: mcore2hf conversion now produces correct Q/K/V weights. 2. **qwen25**: existing checkpoints converted with --model-type-hf llama2 must be re-converted with --model-type-hf qwen25 to include linear_qkv.bias. 3. **All other models**: no behavioral change. ## How was this patch tested? 1. All 5 mcore2hf cases covering qkv_type='unpack' (seed-oss MHA TP2, qwen25 MHA TP4, qwen2-moe GQA, llama2 MQA TP2, llama2 MHA TP1) passed. 2. HF→MG round-trip cases for qwen25 and qwen2-moe passed. 3. Four qwen25 layerwise disaggregated training CI scripts were updated to --model-type-hf qwen25, fixing the previously reported Missing linear_qkv.bias errors. fixes#1551 See merge request: Ascend/MindSpeed-LLM!4656 | 48 分钟前 |
| [feat]Modify and improve the configuration of the pre-commit open-source code detection tool Co-authored-by: wujinyuan1<wujinyuan1@huawei.com> # message auto-generated for no-merge-commit merge: !4448 merge master into master [feat]Modify and improve the configuration of the pre-commit open-source code detection tool Created-by: wujinyuan1 Commit-by: wujinyuan1 Merged-by: ascend-robot Description: ## What this PR does / why we need it? pre-commit 工具规则配置修改:试运行阶段发现python bandit工具检测过严,超出昇腾编程规范。 此次修改重点修改bandit规则。 ## Does this PR introduce any user-facing change? 门禁codecheck检测规则精简,提升代码合入效率 ## How was this patch tested? 代码扫描工具配置,不涉及代码仓功能 See merge request: Ascend/MindSpeed-LLM!4448 | 1 个月前 |
| docs(fsdp2): fix documentation format issues and improve readability Co-authored-by: wangjiangben<wangjiangben@huawei.com> # message auto-generated for no-merge-commit merge: !4476 merge docs/fix-fsdp2-docs-format into master docs(fsdp2): fix documentation format issues and improve readability Created-by: wangjiangben Commit-by: wangjiangben Merged-by: ascend-robot Description: ## Summary Fix format errors in FSDP2 documentation and optimize document structure to improve readability and compliance with Markdown standards. **Format Fixes:** - Fix table format issues in quantization.md (missing header separators, column alignment errors) - Fix HTML entity syntax error in arguments.md ("ulysses" missing semicolon) - Remove extra blank lines in code blocks - Standardize indentation in example scripts **Structure Optimization:** - Unify list markers to standard Markdown - syntax - Optimize DTensor section hierarchy for better structure clarity - Convert reference links to proper Markdown link format - Split long paragraphs in quantization descriptions for better readability - Fix MD032 lint error (add blank line before list) **Files Changed:** - arguments.md: Fix HTML entity syntax - fsdp2_basic_features.md: Optimize structure hierarchy and list format - quantization.md: Fix table format, optimize description text See merge request: Ascend/MindSpeed-LLM!4476 | 1 个月前 |
| [feat]Modify and improve the configuration of the pre-commit open-source code detection tool Co-authored-by: wujinyuan1<wujinyuan1@huawei.com> # message auto-generated for no-merge-commit merge: !4448 merge master into master [feat]Modify and improve the configuration of the pre-commit open-source code detection tool Created-by: wujinyuan1 Commit-by: wujinyuan1 Merged-by: ascend-robot Description: ## What this PR does / why we need it? pre-commit 工具规则配置修改:试运行阶段发现python bandit工具检测过严,超出昇腾编程规范。 此次修改重点修改bandit规则。 ## Does this PR introduce any user-facing change? 门禁codecheck检测规则精简,提升代码合入效率 ## How was this patch tested? 代码扫描工具配置,不涉及代码仓功能 See merge request: Ascend/MindSpeed-LLM!4448 | 1 个月前 |
| docs: update README and CONTRIBUTING.md Co-authored-by: xuwenyue<xuwenyue1@huawei.com> # message auto-generated for no-merge-commit merge: !4440 merge master into master docs: update README and CONTRIBUTING.md Created-by: xuwenyue Commit-by: xuwenyue Merged-by: ascend-robot Description: ## What this PR does / why we need it? update README, CONTRIBUTING.md, FAQ.md, features/README.md, performance_tuning.md ## Does this PR introduce any user-facing change? NA ## How was this patch tested? NA See merge request: Ascend/MindSpeed-LLM!4440 | 1 个月前 |
| update LICENSE. add license Signed-off-by: fengliangjun <fengliangjun@huawei.com> | 1 年前 |
| [pytorch][feature] chunkloss for fsdp2 Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> Co-authored-by: wenjiang23<wenjiang@huawei.com> # message auto-generated for no-merge-commit merge: !3784 merge master into master [pytorch][feature] chunkloss for fsdp2 Created-by: wenjiang23 Commit-by: wenjiang23;sunjunjie1587 Merged-by: ascend-robot Description: 为fsdp2添加chunkloss特性,解决lm_head/cross entropy loss峰值显存问题 See merge request: Ascend/MindSpeed-LLM!3784 | 6 个月前 |
| docs: fix quick_start、README、install_guide docs and imporve user experience Co-authored-by: xuwenyue<xuwenyue1@huawei.com> # message auto-generated for no-merge-commit merge: !4639 merge fix_doc_0624 into master docs: fix quick_start、README、install_guide docs and imporve user experience Created-by: xuwenyue Commit-by: xuwenyue Merged-by: ascend-robot Description: ## What this PR does / why we need it? This PR mainly focuses on improving documentation completeness and user experience for MindSpeed LLM: 1. Added FSDP2 quick start documentation link to README.md and docs_guide.md to provide better navigation for users who want to use the FSDP2 training backend. 2. Added an English placeholder note in docs/en/README.md indicating that documentation translation is in progress, guiding users to refer to Chinese docs for the latest content. 3. Added a table legend explanation in README.md to clarify the meaning of "Released", "✅" and "❌" fields, improving documentation clarity. 4. Added ModelScope download option for model weights in both quick_start.md and fsdp2_quick_start.md, which is more accessible for developers in China who may have difficulty accessing HuggingFace. ## Does this PR introduce any user-facing change? - Documentation updates : Multiple documentation files have been updated with new content and links. - Path changes : - README.md - Added FSDP2 quick start link and table legend - docs/zh/docs_guide.md - Updated FSDP2 documentation link - docs/en/README.md - Added translation in-progress note - docs/zh/pytorch/training/quick_start.md - Added ModelScope weight download method - docs/zh/pytorch/training/fsdp2_quick_start.md - Added ModelScope weight download method ## How was this patch tested? 1. Verified all links are correctly formatted and point to existing documentation files 2. Checked markdown syntax correctness for all changes 3. Confirmed consistency between quick_start.md and fsdp2_quick_start.md for the ModelScope download section 4. Verified that all documentation updates maintain proper structure and readability No code changes were introduced in this PR, so no functional testing is required. The changes only affect documentation content and navigation. See merge request: Ascend/MindSpeed-LLM!4639 | 5 天前 |
| !2500 update notice Merge pull request !2500 from sunjunjie/master | 1 年前 |
| transformer 4 upgrade 5 & adapt tests Co-authored-by: wanggangguo<wanggangguo@huawei.com> # message auto-generated for no-merge-commit merge: !4545 merge upgrade into master transformer 4 upgrade 5 & adapt tests Created-by: isfrapples Commit-by: wanggangguo Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4545 | 15 天前 |
| transformer 4 upgrade 5 & adapt tests Co-authored-by: wanggangguo<wanggangguo@huawei.com> # message auto-generated for no-merge-commit merge: !4545 merge upgrade into master transformer 4 upgrade 5 & adapt tests Created-by: isfrapples Commit-by: wanggangguo Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4545 | 15 天前 |
| refactor(megatron):update coverage script Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4295 merge br_master_coverage_fix_0313 into master refactor(megatron):update coverage script Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: # 覆盖率分析脚本更新 ---- ## What this PR does / why we need it? 更新仓库run_coverage.sh脚本,修复覆盖率分析扫描文件不全的问题 ## Does this PR introduce any user-facing change? 不影响仓库基本功能,旨在优化仓库覆盖率分析功能 ## How was this patch tested? 已在蓝区机器上自测通过 See merge request: Ascend/MindSpeed-LLM!4295 | 3 个月前 |
| refactor(megatron):update coverage script Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4295 merge br_master_coverage_fix_0313 into master refactor(megatron):update coverage script Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: # 覆盖率分析脚本更新 ---- ## What this PR does / why we need it? 更新仓库run_coverage.sh脚本,修复覆盖率分析扫描文件不全的问题 ## Does this PR introduce any user-facing change? 不影响仓库基本功能,旨在优化仓库覆盖率分析功能 ## How was this patch tested? 已在蓝区机器上自测通过 See merge request: Ascend/MindSpeed-LLM!4295 | 3 个月前 |
| feature(pytorch): FSDP2 support hardware-adaptive execution Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4343 merge fsdp2_gpu into master feature(pytorch): FSDP2 support hardware-adaptive execution Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? FSDP2 support hardware-adaptive execution ## Does this PR introduce any user-facing change? No ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!4343 | 3 个月前 |
| refactor(megatron):update coverage script Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4295 merge br_master_coverage_fix_0313 into master refactor(megatron):update coverage script Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: # 覆盖率分析脚本更新 ---- ## What this PR does / why we need it? 更新仓库run_coverage.sh脚本,修复覆盖率分析扫描文件不全的问题 ## Does this PR introduce any user-facing change? 不影响仓库基本功能,旨在优化仓库覆盖率分析功能 ## How was this patch tested? 已在蓝区机器上自测通过 See merge request: Ascend/MindSpeed-LLM!4295 | 3 个月前 |
| feat(pytorch): add DeepSeek4 fine-tuning template Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com> # message auto-generated for no-merge-commit merge: !4436 merge dsv4 into master feat(pytorch): add DeepSeek4 fine-tuning template Created-by: HANHU1CHEN Commit-by: HanhuiChen Merged-by: ascend-robot Description: ## What this PR does / why we need it? Adds a fine-tuning template for the DeepSeek4 model series to support its specific prompt format, including thinking mode, tool calling (DSML format), and reasoning effort control. ## Does this PR introduce any user-facing change? Yes — users can now select --prompt-type deepseek4 to fine-tune DeepSeek4 models. Two new behaviors are also exposed: - --enable-thinking controls thinking vs chat mode - --reasoning-effort {max,high} inserts a max-effort instruction prefix; only valid when thinking is enabled - --drop-thinking controls whether reasoning content is kept in each turn ## How was this patch tested? Tested with byte-level alignment against the official encoding_dsv4 script. See merge request: Ascend/MindSpeed-LLM!4436 | 1 个月前 |
| [pytorch][feature]Add RL data processing Co-authored-by: fh_188<fenghui32@huawei.com> # message auto-generated for no-merge-commit merge: !3646 merge fenzhi into master [pytorch][feature]Add RL data processing Created-by: weixin_44917616 Commit-by: fh_188 Merged-by: ascend-robot Description: add preprocess prmompt 修改模板路径 See merge request: Ascend/MindSpeed-LLM!3646 | 7 个月前 |
| feat(pytorch): support o lora rank and q lora rank in v4pro Co-authored-by: dingzicha1997<dingzilin@huawei.com> # message auto-generated for no-merge-commit merge: !4427 merge master into master feat(pytorch): support o lora rank and q lora rank in v4pro Created-by: dingzicha1997 Commit-by: dingzicha1997 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4427 | 2 个月前 |
| refactor(megatron):update coverage script Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4295 merge br_master_coverage_fix_0313 into master refactor(megatron):update coverage script Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: # 覆盖率分析脚本更新 ---- ## What this PR does / why we need it? 更新仓库run_coverage.sh脚本,修复覆盖率分析扫描文件不全的问题 ## Does this PR introduce any user-facing change? 不影响仓库基本功能,旨在优化仓库覆盖率分析功能 ## How was this patch tested? 已在蓝区机器上自测通过 See merge request: Ascend/MindSpeed-LLM!4295 | 3 个月前 |
| refactor(megatron):update coverage script Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4295 merge br_master_coverage_fix_0313 into master refactor(megatron):update coverage script Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: # 覆盖率分析脚本更新 ---- ## What this PR does / why we need it? 更新仓库run_coverage.sh脚本,修复覆盖率分析扫描文件不全的问题 ## Does this PR introduce any user-facing change? 不影响仓库基本功能,旨在优化仓库覆盖率分析功能 ## How was this patch tested? 已在蓝区机器上自测通过 See merge request: Ascend/MindSpeed-LLM!4295 | 3 个月前 |
| feat: compatibility improvements among third-party dependencies Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !4638 merge requirement-fix into master feat: compatibility improvements among third-party dependencies Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1.Improves compatibility with third-party dependencies. 2.Removes version constraints on third-party packages, including numpy, datasets, ray, bitsandbytes-npu-beta, mistral_common, and antlr4-python3-runtime. 3.Adds support for newer versions of PEFT. 4.The transformers version has already been approved and is not included in this update. ## Does this PR introduce any user-facing change? No. This PR only contains compatibility improvements and does not introduce any user-facing changes. ## How was this patch tested? 1.Passed smoke tests in the Blue Zone. 2.Passed ST validation in the LLM repository. See merge request: Ascend/MindSpeed-LLM!4638 | 3 天前 |
| add rl ci Co-authored-by: fh_188<fenghui32@huawei.com> # message auto-generated for no-merge-commit merge: !3951 merge master_rl_ci into master [pytorch][feature]Adapt RL for vllm0110 and add CI Created-by: weixin_44917616 Commit-by: fh_188 Merged-by: ascend-robot Description: [pytorch][feature]Adapt RL for vllm0110 and add CI See merge request: Ascend/MindSpeed-LLM!3951 | 6 个月前 |
| [pytorch][build]update setup.py version and python support range master Co-authored-by: tichang<tichang@huawei.com> # message auto-generated for no-merge-commit merge: !4076 merge chore/update_setup.py_version_and_pyversion_info into master [pytorch][build]update setup.py version and python support range master Created-by: tichang Commit-by: tichang Merged-by: ascend-robot Description: chore(setup): update setup.py version and python support range 在master分支修改了setup.py中的版本信息和python支持版本 See merge request: Ascend/MindSpeed-LLM!4076 | 5 个月前 |
| fix:Fix FSDP2 backend scheduler max step calculation. Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> # message auto-generated for no-merge-commit merge: !4505 merge master into master fix:Fix FSDP2 backend scheduler max step calculation. Created-by: sunjunjie1587 Commit-by: sunjunjie1587 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4505 | 1 个月前 |