文件最后提交记录最后更新时间
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261612 小时前
[Bugfix] openclaw fix some bugs Co-authored-by: yangx_sy<sunyang49@huawei.com> # message auto-generated for no-merge-commit merge: !2483 merge fsdp2_fix into master [Bugfix] openclaw fix some bugs Created-by: yangx_sy Commit-by: yangx_sy Merged-by: ascend-robot Description: ## What this PR does / why we need it? 基于openclaw扫描结果,对纯血fsdp2相应代码小bug做修复 ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!248312 天前
[Feature] support broadcast loading Co-authored-by: htwang<wanghaitao60@huawei.com> # message auto-generated for no-merge-commit merge: !2267 merge ckpt_optim into master [Feature] support broadcast loading Created-by: htwang Commit-by: htwang Merged-by: ascend-robot Description: ## What this PR does / why we need it? 当权重比较大,并且卡数比较多的是以后,权重加载时磁盘IO会成为瓶颈,通过0卡读再广播的形式可以有效降低IO ## Does this PR introduce any user-facing change? 支持rank0加载权重再广播给其他卡的权重加载方式 ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!22672 个月前
feat: add HF online-load method Co-authored-by: BianZheng<bianzheng8@huawei.com> # message auto-generated for no-merge-commit merge: !2544 merge hfnodis into master feat: add HF online-load method Created-by: liuxi_ Commit-by: liuxi_;BianZheng Merged-by: ascend-robot Description: ## What this PR does / why we need it? ![image.png](https://raw.gitcode.com/user-images/assets/7404510/b2844d96-7508-4833-9a45-bc03459bb24e/image.png 'image.png') ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? ut、test same model with dcp data & hf data, and check loss. See merge request: Ascend/MindSpeed-MM!25445 天前
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261612 小时前
style: pre-commit autofix cleancode (base check) Co-authored-by: liyingxuan<liyingxuan3@huawei.com> # message auto-generated for no-merge-commit merge: !2616 merge master into master style: pre-commit autofix cleancode (base check) Created-by: liyx616 Commit-by: liyingxuan Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-MM!261612 小时前
[bugfix] Compatibility between lora and meta initialization in fsdp2 backend Co-authored-by: LKONE<wanglikai4@huawei.com> # message auto-generated for no-merge-commit merge: !2499 merge master into master [bugfix] Compatibility between lora and meta initialization in fsdp2 backend Created-by: wanglikai1019 Commit-by: LKONE Merged-by: ascend-robot Description: ## What this PR does / why we need it? fsdp2后端lora特性和meta初始化存在兼容问题,具体为以下两个方面: (1)lora权重初始化早于meta初始化,导致lora权重受empty_like影响,无法正确初始化;调整为在meta初始化之后再调用一次init_model_weights进行随机初始化; (2)lora权重包裹早于基础dcp权重加载,lora权重包裹后,基础权重的key会增加.base_layer前缀,导致无法与权重中的key进行匹配;调整为新增lora专用的ModelState,其中在权重加载时先删除.base_layer前缀再添加.base_layer前缀。 其余为根据pre-commit codecheck检查结果进行的改动,具体为以下几个方面: (1)Redefining name 'args' from outer scope:trainer文件中,main函数里面的全局变量args和后续使用的局部变量名称冲突,修改main函数的里面的全局变量名args为arguments,与后续的局部变量做区分; (2)String statement has no effect (pointless-string-statement):"""Build optimizer for the model."""这行注释在代码中间,需要调整位置; (3)Attempted relative import beyond top-level package (relative-beyond-top-level):import引用为相对路径,改为绝对路径; (4)Using open without explicitly specifying an encoding (unspecified-encoding):调用open函数时为传入具体的编码方式,修改为linux通用的utf-8; (5)Use lazy % formatting in logging functions (logging-fstring-interpolation):logger info中使用了 {} 这种延迟格式化写法,调整为%s。 其余为格式修改 ## Does this PR introduce any user-facing change? 无 ## How was this patch tested? 开启meta初始化,lora微调能否正确进行,基础部分权重能否正确加载,loss是否正常不为NAN。 See merge request: Ascend/MindSpeed-MM!249920 天前