| modify npu_multi_head_latent_attention atb op Co-authored-by: weiguihua_gitee<984323595@qq.com> # message auto-generated for no-merge-commit merge: merge pr_3146_1757900574166 into master modify npu_multi_head_latent_attention atb op Created-by: weiguihuagitee Commit-by: weiguihuagitee;weiguihua_gitee Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> add npu_multi_head_latent_attention_with_lse atb op **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: To support the cp and sp scenario, it is necessary to add the output of lse in MLA. **Special notes for your reviewers**: See merge request: Ascend/op-plugin!3169 | 9 个月前 |