文件最后提交记录最后更新时间
feat(torch): Squash merge fsdp2_dev into master. Co-authored-by: zs-derrick1<1434012475@qq.com> # message auto-generated for no-merge-commit merge: !2223 merge master into master feat(torch): Squash merge fsdp2_dev into master. Created-by: zs-derrick1 Commit-by: zs-derrick1 Merged-by: ascend-robot Description: ## Motivation Squash merge fsdp2_dev into master. ## Modification fsdp2_dev分支合并至master ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/MindSpeed-MM!22233 个月前
Adapt fused operator in Qwen3VL MoE forward Co-authored-by: pangwendong<pangwendong@huawei.com> # message auto-generated for no-merge-commit merge: !2286 merge qwen3vl_fused into master Adapt fused operator in Qwen3VL MoE forward Created-by: WendongPang Commit-by: pangwendong Merged-by: ascend-robot Description: ## What this PR does / why we need it? https://gitcode.com/Ascend/MindSpeed-MM/issues/276 ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Enable deterministic computation (use_deter_comp: true) and train under default configuration and fusion operator respectively. The relative error of loss comparison is less than 10 ^ -5, and the relative error of gradient comparison is less than 10 ^ -4. See merge request: Ascend/MindSpeed-MM!22862 个月前