文件最后提交记录最后更新时间
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261613 小时前
[feature]support moe expert merge weight convert from hf to dcp or vice versa Co-authored-by: pjgao<gaopengju3@huawei.com> # message auto-generated for no-merge-commit merge: !1752 merge master into master [feature]support moe expert merge weight convert from hf to dcp or vice versa Created-by: PIPIXIU Commit-by: pjgao Merged-by: ascend-robot Description: ## Motivation InternVL3.5使用MOE专家合并的方式需要权重转换,当前仓上是两段转换,先将原仓safetensors转换成合并专家后的safetensors,再转换成torch dcp,因此本PR新增转换接口mm-convert ExpertMergeDcpConverter hf_to_dcp /mm-convert ExpertMergeDcpConverter dcp_to_hf ## Modification 1. 增加转换类ExpertMergeDcpConverter 2. 抽象Commandable管理所有的命令行接口,提供add_command接口,允许增加函数到命令行中 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17526 个月前
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261613 小时前
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261613 小时前
!1445 [Bugfix]Modify the file permission control in the weight conversion scenario Merge pull request !1445 from zhangxubin/master 9 个月前
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261613 小时前
[feature] merge lora-fsdp2 backend safetensors to huggingface weight Co-authored-by: qiaoxiaomin<qiaoxiaomin@huawei.com> # message auto-generated for no-merge-commit merge: !2436 merge master into master [feature] merge lora-fsdp2 backend safetensors to huggingface weight Created-by: Miss_min Commit-by: qiaoxiaomin Merged-by: ascend-robot Description: ## What this PR does / why we need it? 提供合并fadp2 后端lora权重到huggingface基础权重 https://gitcode.com/Ascend/MindSpeed-MM/issues/176 ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!24361 个月前
[Feature] support Qwen3VL ckpt convert Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !1610 merge master into master [Feature] support Qwen3VL ckpt convert Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## Motivation 支持Qwen3VL FSDP2训练后保存的torch dcp权重转换成hf权重,直接使用transformers推理 ## Modification dcp权重转换成huggingface权重的临时转换脚本,之后会将其中的关键函数接入Converter中 权重转换目录下有一个types.py文件,和python的标准库重名,会有import时会有循环import的风险,改名为mm_types.py transformers_model中的初始化路径改成从model.json的init_from_hf_path, args.load用于加载断点续训的路径 启动脚本的ckpt-format改为torch_dcp ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!16107 个月前
[feature]support moe expert merge weight convert from hf to dcp or vice versa Co-authored-by: pjgao<gaopengju3@huawei.com> # message auto-generated for no-merge-commit merge: !1752 merge master into master [feature]support moe expert merge weight convert from hf to dcp or vice versa Created-by: PIPIXIU Commit-by: pjgao Merged-by: ascend-robot Description: ## Motivation InternVL3.5使用MOE专家合并的方式需要权重转换,当前仓上是两段转换,先将原仓safetensors转换成合并专家后的safetensors,再转换成torch dcp,因此本PR新增转换接口mm-convert ExpertMergeDcpConverter hf_to_dcp /mm-convert ExpertMergeDcpConverter dcp_to_hf ## Modification 1. 增加转换类ExpertMergeDcpConverter 2. 抽象Commandable管理所有的命令行接口,提供add_command接口,允许增加函数到命令行中 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!17526 个月前