| feat(pytorch): add DeepSeek4 fine-tuning template
Co-authored-by: HanhuiChen<chenhanhui1@h-partners.com>
# message auto-generated for no-merge-commit merge:
!4436 merge dsv4 into master
feat(pytorch): add DeepSeek4 fine-tuning template
Created-by: HANHU1CHEN
Commit-by: HanhuiChen
Merged-by: ascend-robot
Description: ## What this PR does / why we need it?
Adds a fine-tuning template for the DeepSeek4 model series to support its specific prompt format, including thinking mode, tool calling (DSML format), and reasoning effort control.
## Does this PR introduce any user-facing change?
Yes — users can now select --prompt-type deepseek4 to fine-tune DeepSeek4 models. Two new behaviors are also exposed:
- --enable-thinking controls thinking vs chat mode
- --reasoning-effort {max,high} inserts a max-effort instruction prefix; only valid when thinking is enabled
- --drop-thinking controls whether reasoning content is kept in each turn
## How was this patch tested?
Tested with byte-level alignment against the official encoding_dsv4 script.
See merge request: Ascend/MindSpeed-LLM!4436 | 25 天前 |