文件最后提交记录最后更新时间
[master][Modify]Add _gather patch support for InternVL3 Co-authored-by: iquoyuw<wuyouqi1@h-partners.com> # message auto-generated for no-merge-commit merge: !1697 merge internvl3-7b into master [master][Modify]Add _gather patch support for InternVL3 Created-by: weixin_47897441 Commit-by: iquoyuw Merged-by: ascend-robot Description: ## Motivation Add _gather patch support for InternVL3 model. ## Modification This modification extends the device type check to include the "Ascend" device type in addition to "cuda" and "npu". ## Self-test (Optional) This patch enables accurate training of Internvl3 on the MS framework by aligning its 100-iteration, 16-bit loss/gradient with the PT framework, and no new accuracy issues are caused. ![image.png](https://raw.gitcode.com/user-images/assets/7404510/05a0c5a2-05b1-4a08-8359-68ae68750cfc/image.png 'image.png') ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!16976 个月前
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261613 小时前