MindSpeed-MM/mindspeed_mm/models/transformers · Ascend/MindSpeed-MM - AtomGit

ascend-robot[Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion

b7d35d83创建于 2025年12月5日历史提交

文件	最后提交记录	最后更新时间
qwen3vl	[Modify] cherry-pick master to 2.3.0 for q4 Co-authored-by: liyx616<liyingxuan3@huawei.com> Co-authored-by: yangx_sy<sunyang49@huawei.com> # message auto-generated for no-merge-commit merge: !1889 merge mm_230 into 2.3.0 [Modify] cherry-pick master to 2.3.0 for q4 Created-by: yangx_sy Commit-by: yangx_sy;liyx616 Merged-by: ascend-robot Description: ## Motivation master分支cherry-pick一些commit到2.3.0商发分支，主要为Q4需求 ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1889	5 个月前
base_model.py	[Modify] cherry-pick master to 2.3.0 for q4 Co-authored-by: liyx616<liyingxuan3@huawei.com> Co-authored-by: yangx_sy<sunyang49@huawei.com> # message auto-generated for no-merge-commit merge: !1889 merge mm_230 into 2.3.0 [Modify] cherry-pick master to 2.3.0 for q4 Created-by: yangx_sy Commit-by: yangx_sy;liyx616 Merged-by: ascend-robot Description: ## Motivation master分支cherry-pick一些commit到2.3.0商发分支，主要为Q4需求 ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1889	5 个月前
configuration_hunyuan.py	[Modify] Refactor the fsdp2 model Co-authored-by: htwang<wanghaitao60@huawei.com> # message auto-generated for no-merge-commit merge: !1793 merge master into master [Modify] Refactor the fsdp2 model Created-by: htwang Commit-by: htwang Merged-by: ascend-robot Description: ## Motivation 1、修改modelzoo至modelhub 2、修改hf_src至transformers ## Modification 1、修改modelzoo至modelhub 2、修改hf_src至transformers ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1793	6 个月前
hunyuan_image_3_pipeline.py	[Modify] Refactor the fsdp2 model Co-authored-by: htwang<wanghaitao60@huawei.com> # message auto-generated for no-merge-commit merge: !1793 merge master into master [Modify] Refactor the fsdp2 model Created-by: htwang Commit-by: htwang Merged-by: ascend-robot Description: ## Motivation 1、修改modelzoo至modelhub 2、修改hf_src至transformers ## Modification 1、修改modelzoo至modelhub 2、修改hf_src至transformers ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1793	6 个月前
internvl3_5.py	[Modify] Change the variable ASCEND_LAUNCH_BLOCKING in internvl3.5 to its default value Co-authored-by: meng-coding<wumengjie6@huawei.com> # message auto-generated for no-merge-commit merge: !1848 merge 2.3.0 into 2.3.0 [Modify] Change the variable ASCEND_LAUNCH_BLOCKING in internvl3.5 to its default value Created-by: meng-coding Commit-by: meng-coding Merged-by: ascend-robot Description: ## Motivation Change the variable ASCEND_LAUNCH_BLOCKING in internvl3.5 to its default value. ## Modification 1.internvl3.5的启动脚本，将ASCEND_LAUNCH_BLOCKING变量值恢复为默认值 2.每个layer增加同步操作，内存复用 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1848	5 个月前
modelhub.py	[Modify] Refactor the fsdp2 model Co-authored-by: htwang<wanghaitao60@huawei.com> # message auto-generated for no-merge-commit merge: !1793 merge master into master [Modify] Refactor the fsdp2 model Created-by: htwang Commit-by: htwang Merged-by: ascend-robot Description: ## Motivation 1、修改modelzoo至modelhub 2、修改hf_src至transformers ## Modification 1、修改modelzoo至modelhub 2、修改hf_src至transformers ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1793	6 个月前
qwen3omni.py	[Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion Co-authored-by: meng-coding<wumengjie6@huawei.com> Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com> # message auto-generated for no-merge-commit merge: !1898 merge pick_master_ckpt_to_230 into 2.3.0 [Modify]Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion Created-by: yaoyaoxu Commit-by: yaoyaoxu;meng-coding Merged-by: ascend-robot Description: ## Motivation Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion ## Modification Performance Optimization for Qwen3-Omni Thinker MoE Expert Weight Conversion ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!1898	5 个月前