文件最后提交记录最后更新时间
[Modify][mindspore] remove temporal patchs and update preprocess_dataset patch Co-authored-by: weixin_47897441<wuyouqi1@h-partners.com> # message auto-generated for no-merge-commit merge: !2100 merge master-0129 into master [Modify][mindspore] remove temporal patchs and update preprocess_dataset patch Created-by: weixin_47897441 Commit-by: weixin_47897441 Merged-by: ascend-robot Description: ## Motivation To optimize the patch management and ensure the normal launch of model, unnecessary patches need to be removed and existing patches need to be updated. ## Modification 1. Removed the unnecessary patches for qwen2vl_vit_model.py to clean up redundant code. 2. Updated the patch for the preprocess_dataset function in convert.py to ensure the normal launch of models. ## Self-test (Optional) - Verified that the model can be successfully launched after the patch adjustments, with no functional abnormalities in dataset preprocessing and model initialization. ![image.png](https://raw.gitcode.com/user-images/assets/7404510/9de18e82-35e3-46fb-ab30-a43f3eac6516/image.png 'image.png') ## BC-breaking (Optional) This PR only involves patch adjustments for specific files and does not introduce compatibility issues with CANN/torch_npu versions or other dependencies. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!21003 个月前
[Modify][mindspore] remove temporal patchs and update preprocess_dataset patch Co-authored-by: weixin_47897441<wuyouqi1@h-partners.com> # message auto-generated for no-merge-commit merge: !2100 merge master-0129 into master [Modify][mindspore] remove temporal patchs and update preprocess_dataset patch Created-by: weixin_47897441 Commit-by: weixin_47897441 Merged-by: ascend-robot Description: ## Motivation To optimize the patch management and ensure the normal launch of model, unnecessary patches need to be removed and existing patches need to be updated. ## Modification 1. Removed the unnecessary patches for qwen2vl_vit_model.py to clean up redundant code. 2. Updated the patch for the preprocess_dataset function in convert.py to ensure the normal launch of models. ## Self-test (Optional) - Verified that the model can be successfully launched after the patch adjustments, with no functional abnormalities in dataset preprocessing and model initialization. ![image.png](https://raw.gitcode.com/user-images/assets/7404510/9de18e82-35e3-46fb-ab30-a43f3eac6516/image.png 'image.png') ## BC-breaking (Optional) This PR only involves patch adjustments for specific files and does not introduce compatibility issues with CANN/torch_npu versions or other dependencies. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!21003 个月前
[Docs] Annotation Standardization Rectification Co-authored-by: LKONE<wanglikai4@huawei.com> # message auto-generated for no-merge-commit merge: !2010 merge master into master [Docs] Annotation Standardization Rectification Created-by: wanglikai1019 Commit-by: LKONE Merged-by: ascend-robot Description: ## Motivation Rectify the annotations for the code repository in accordance with the annotation specification requirements. ## Modification 1. Replace Chinese annotations with English ones; 2. Organize the environment variable documentation and add explanations in the startup scripts (currently implemented on Qwen3VL and Wan2.2); 3. Add annotations to the public dataset functions and public model classes of Wan2.2 and Qwen3VL; 4. Adjust the annotation format (e.g., number of indentations). ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!20104 个月前
[mindspore] support qwen3vl finetune for megatron backend Co-authored-by: ffmh<fengminghao2@huawei.com> # message auto-generated for no-merge-commit merge: !1785 merge qwen3vl_megatron into master [mindspore] support qwen3vl finetune for megatron backend Created-by: ffmh Commit-by: ffmh Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. 适配 qwen3vl 8B/30B megatron后端 ## Modification Please briefly describe what modification is made in this PR. 添加 layer_spec修改、text模块transformer block/layer添加deepstack相关适配 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17856 个月前
[Modify][mindspore] remove temporal patchs and update preprocess_dataset patch Co-authored-by: weixin_47897441<wuyouqi1@h-partners.com> # message auto-generated for no-merge-commit merge: !2100 merge master-0129 into master [Modify][mindspore] remove temporal patchs and update preprocess_dataset patch Created-by: weixin_47897441 Commit-by: weixin_47897441 Merged-by: ascend-robot Description: ## Motivation To optimize the patch management and ensure the normal launch of model, unnecessary patches need to be removed and existing patches need to be updated. ## Modification 1. Removed the unnecessary patches for qwen2vl_vit_model.py to clean up redundant code. 2. Updated the patch for the preprocess_dataset function in convert.py to ensure the normal launch of models. ## Self-test (Optional) - Verified that the model can be successfully launched after the patch adjustments, with no functional abnormalities in dataset preprocessing and model initialization. ![image.png](https://raw.gitcode.com/user-images/assets/7404510/9de18e82-35e3-46fb-ab30-a43f3eac6516/image.png 'image.png') ## BC-breaking (Optional) This PR only involves patch adjustments for specific files and does not introduce compatibility issues with CANN/torch_npu versions or other dependencies. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!21003 个月前