[Feature] fsdp2 qwen3vl 30B support lora finetune
Co-authored-by: LKONE<wanglikai4@huawei.com>
# message auto-generated for no-merge-commit merge:
!2189 merge master into master
[Feature] fsdp2 qwen3vl 30B support lora finetune
Created-by: wanglikai1019
Commit-by: LKONE
Merged-by: ascend-robot
Description: ## Motivation
During gradient recomputation in the backward pass, it is necessary to obtain the gradient function outside this block as the entry point for the backward pass. Therefore, a patch is added to enable the gradient function of the embed_tokens layer, allowing LoRA fine-tuning to enter the gradient recomputation backward pass.
Related history PR: https://gitee.com/ascend/MindSpeed-LLM/pulls/1498
## Modification
1.Add a new LoRA fine-tuning script for 30B models
2.Add parsing code for LoRA parameters
3.Add a LoRA patch for the gradient recomputation scenario
## Self-test (Optional)
If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached.
## BC-breaking (Optional)
If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR.
## Checklist
**Before PR**:
- [ ] The new code needs to comply with the Clean Code specification.
- [ ] The PR content is self-checked, and the expression can be clear and the writing standardized
**After PR**:
- [ ] CLA has been signed and all committers have signed the CLA in this PR.
- [ ] The ci-pipeline is passed, Code Check is passed.
See merge request: Ascend/MindSpeed-MM!2189
feat(torch): fsdp2 wan2.2 14b t2v support lora finetune
Co-authored-by: LKONE<wanglikai4@huawei.com>
# message auto-generated for no-merge-commit merge:
!2218 merge master into master
feat(torch): fsdp2 wan2.2 14b t2v support lora finetune
Created-by: wanglikai1019
Commit-by: LKONE
Merged-by: ascend-robot
Description: ## Motivation
Please describe the motivation of this PR and the goal you want to achieve through this PR.
## Modification
Please briefly describe what modification is made in this PR.
## Self-test (Optional)
If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached.
## BC-breaking (Optional)
If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR.
## Checklist
**Before PR**:
- [ ] The new code needs to comply with the Clean Code specification.
- [ ] The PR content is self-checked, and the expression can be clear and the writing standardized
**After PR**:
- [ ] CLA has been signed and all committers have signed the CLA in this PR.
- [ ] The ci-pipeline is passed, Code Check is passed.
See merge request: Ascend/MindSpeed-MM!2218