MindSpeed-MM/mindspeed_mm/mindspore/utils · Ascend/MindSpeed-MM - AtomGit

8549f54b创建于 2025年11月21日历史提交

文件	最后提交记录	最后更新时间
transformer_model_config.py	[mindspore] support qwen3vl finetune for megatron backend Co-authored-by: ffmh<fengminghao2@huawei.com> # message auto-generated for no-merge-commit merge: !1785 merge qwen3vl_megatron into master [mindspore] support qwen3vl finetune for megatron backend Created-by: ffmh Commit-by: ffmh Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. 适配 qwen3vl 8B/30B megatron后端 ## Modification Please briefly describe what modification is made in this PR. 添加 layer_spec修改、text模块transformer block/layer添加deepstack相关适配 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1785	6 个月前