| modify npu_multi_head_latent_attention atb op
Co-authored-by: weiguihua_gitee<984323595@qq.com>
# message auto-generated for no-merge-commit merge:
merge pr_3146_1757900574166 into master
modify npu_multi_head_latent_attention atb op
Created-by: weiguihuagitee
Commit-by: weiguihuagitee;weiguihua_gitee
Merged-by: ascend-robot
Description: <!-- Thanks for sending a pull request!
-->
add npu_multi_head_latent_attention_with_lse atb op
**What type of PR is this?**
> Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line:
>
> /kind bug
> /kind task
> /kind feature
**What does this PR do / why do we need it**:
To support the cp and sp scenario, it is necessary to add the output of lse in MLA.
**Special notes for your reviewers**:
See merge request: Ascend/op-plugin!3169 | 8 个月前 |