| chore(fsdp2): develop longcat-flash-lite model in fsdp2
Co-authored-by: guihaowen666<guihaowen@huawei.com>
# message auto-generated for no-merge-commit merge:
!4344 merge br_master_longcat_flash_lite_fsdp2 into master
chore(fsdp2): develop longcat-flash-lite model in fsdp2
Created-by: guihaowen666
Commit-by: guihaowen666
Merged-by: ascend-robot
Description:
## What this PR does / why we need it?
develop longcat-flash-lite model in fsdp2
## Does this PR introduce any user-facing change?
new model development, no user-facing change
## How was this patch tested?
Run the inference task and check whether the model can perform normal dialogs.
See merge request: Ascend/MindSpeed-LLM!4344 | 10 小时前 |