MindSpeed-LLM/configs · Ascend/MindSpeed-LLM - AtomGit

ascend-robotfeat(pytorch): add dsv4 mg2hf

文件	最后提交记录	最后更新时间
checkpoint	feat(pytorch): add dsv4 mg2hf Co-authored-by: qyzqyz<quyueze@h-partners.com> # message auto-generated for no-merge-commit merge: !4458 merge master into master feat(pytorch): add dsv4 mg2hf Created-by: qyzqyz Commit-by: qyzqyz Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1. add dsv4 mg2hf - only support pp - only support etp = 1 or tp = 1 2. fix dsv4 hf2mg vpp ## Does this PR introduce any user-facing change? if use base model of dsv4 to do mg2hf convert, please set --model-type-hf with deepseek4_base ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!4458	19 天前
evaluate	!2183 新增cmmlu评估 Merge pull request !2183 from shenjiarun/master	1 年前
finetune	feat(pytorch): add DeepSeek4 fine-tuning template Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com> # message auto-generated for no-merge-commit merge: !4436 merge dsv4 into master feat(pytorch): add DeepSeek4 fine-tuning template Created-by: HANHU1CHEN Commit-by: HanhuiChen Merged-by: ascend-robot Description: ## What this PR does / why we need it? Adds a fine-tuning template for the DeepSeek4 model series to support its specific prompt format, including thinking mode, tool calling (DSML format), and reasoning effort control. ## Does this PR introduce any user-facing change? Yes — users can now select --prompt-type deepseek4 to fine-tune DeepSeek4 models. Two new behaviors are also exposed: - `--enable-thinking` controls thinking vs chat mode - `--reasoning-effort {max,high}` inserts a max-effort instruction prefix; only valid when thinking is enabled - `--drop-thinking` controls whether reasoning content is kept in each turn ## How was this patch tested? Tested with byte-level alignment against the official encoding_dsv4 script. See merge request: Ascend/MindSpeed-LLM!4436	25 天前
fsdp2	refactor(pytorch): Model Sunset Plan II Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com> # message auto-generated for no-merge-commit merge: !4337 merge deprecated into master refactor(pytorch): Model Sunset Plan II Created-by: HANHU1CHEN Commit-by: HanhuiChen Merged-by: ascend-robot Description: ## What this PR does / why we need it? This PR is for removing deprecated model scripts, please refer associated issue for details. ## Does this PR introduce any user-facing change? This will not affect users; they can continue to use the deprecated model for training in the previous supported version. ## How was this patch tested? Following internal discussions and the public announcement of the issue, we have decided to remove these models. See merge request: Ascend/MindSpeed-LLM!4337	2 个月前
rlhf	add rl ci Co-authored-by: fh_188<fenghui32@huawei.com> # message auto-generated for no-merge-commit merge: !3951 merge master_rl_ci into master [pytorch][feature]Adapt RL for vllm0110 and add CI Created-by: weixin_44917616 Commit-by: fh_188 Merged-by: ascend-robot Description: [pytorch][feature]Adapt RL for vllm0110 and add CI See merge request: Ascend/MindSpeed-LLM!3951	5 个月前