文件最后提交记录最后更新时间
Add npu_all_gather_base_mm testcase Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: !3339 merge allgather into master Add npu_all_gather_base_mm testcase Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: 本地验证通过: ![image.png](https://raw.gitcode.com/user-images/assets/7403085/a54d8e40-6542-4484-b006-eeb700b306ff/image.png 'image.png') See merge request: Ascend/op-plugin!33397 个月前
!2448 ADD ALLTOALLVGMM AND GMMALLTOALLV Merge pull request !2448 from 梅文瑄/master 1 年前
!2185 Added the atb operator monitoring case. Merge pull request !2185 from 史佳馨/master 1 年前
OP hash supports thread-level Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: !3867 merge 7.3.0 into 7.3.0 OP hash supports thread-level Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: op仓的atb op复用采用线程级单例。 当前op仓的atb缓存设计为进程级别的单列,多线程场景存在op复用冲突,atb底层的setup和exector不支持相同的op在不同线程。 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!38675 个月前
!2229 The self-attention PREFIX_ENCODER and pagedattention_qlens atb operators are added. Merge pull request !2229 from 史佳馨/master 1 年前
!2185 Added the atb operator monitoring case. Merge pull request !2185 from 史佳馨/master 1 年前
!2578 Added the alibi scenario of the atb pa operator, alibi scenario of the… Merge pull request !2578 from 史佳馨/master 11 个月前
!2175 Add group_topk. Merge pull request !2175 from 王夏夏/master 1 年前
!2733 Fix cpu tensor deadlocks Merge pull request !2733 from wgb/atb 10 个月前
!2632 add atb mla and prefix encoder ut Merge pull request !2632 from wgb/atb 11 个月前
OP hash supports thread-level Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: !3867 merge 7.3.0 into 7.3.0 OP hash supports thread-level Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: op仓的atb op复用采用线程级单例。 当前op仓的atb缓存设计为进程级别的单列,多线程场景存在op复用冲突,atb底层的setup和exector不支持相同的op在不同线程。 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!38675 个月前
OP hash supports thread-level Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: !3867 merge 7.3.0 into 7.3.0 OP hash supports thread-level Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: op仓的atb op复用采用线程级单例。 当前op仓的atb缓存设计为进程级别的单列,多线程场景存在op复用冲突,atb底层的setup和exector不支持相同的op在不同线程。 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!38675 个月前
!2646 Add atb meta and doc feature and mlapo adaption Merge pull request !2646 from wgb/atb 11 个月前
PA surpport aclgraph Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: merge atb into master PA surpport aclgraph Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: 1. ATB,C++接入方式下发方式新增workpspace支持外部传入,以支持aclgraph的update特性。 2. 新增特性使能atb pa接口,支持aclgraph **Special notes for your reviewers**: See merge request: Ascend/op-plugin!32048 个月前
!2456 modify atb monitoring case Merge pull request !2456 from 史佳馨/master 1 年前
!2456 modify atb monitoring case Merge pull request !2456 from 史佳馨/master 1 年前
!2373 Fix libatb.so not found in some docker Merge pull request !2373 from wgb/bugfix 1 年前
PA surpport aclgraph Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: merge atb into master PA surpport aclgraph Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: 1. ATB,C++接入方式下发方式新增workpspace支持外部传入,以支持aclgraph的update特性。 2. 新增特性使能atb pa接口,支持aclgraph **Special notes for your reviewers**: See merge request: Ascend/op-plugin!32048 个月前
Fix potential integer overflow issues in npu_paged_cache_load Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: !3413 merge bugfix into master Fix potential integer overflow issues in npu_paged_cache_load Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!34136 个月前
!2560 Atb support aclgraph Merge pull request !2560 from wgb/atb_nz 11 个月前
OP hash supports thread-level Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: !3867 merge 7.3.0 into 7.3.0 OP hash supports thread-level Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: op仓的atb op复用采用线程级单例。 当前op仓的atb缓存设计为进程级别的单列,多线程场景存在op复用冲突,atb底层的setup和exector不支持相同的op在不同线程。 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!38675 个月前
OP hash supports thread-level Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: !3867 merge 7.3.0 into 7.3.0 OP hash supports thread-level Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: op仓的atb op复用采用线程级单例。 当前op仓的atb缓存设计为进程级别的单列,多线程场景存在op复用冲突,atb底层的setup和exector不支持相同的op在不同线程。 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!38675 个月前
!2625 The ut care of npu_ring_mla and npu_fused_add_topk_div is added. Merge pull request !2625 from 史佳馨/master 11 个月前
!2617 ATB support torch.compile with fullgraph Merge pull request !2617 from wgb/atb3 11 个月前
!2632 add atb mla and prefix encoder ut Merge pull request !2632 from wgb/atb 11 个月前
适配支持两类算子:ffn_worker_scheduler_/ffn_worker_scheduler、 attention_worker_scheduler_/attention_worker_scheduler Co-authored-by: panweicheng<panweicheng@huawei.com> # message auto-generated for no-merge-commit merge: !3397 merge master into master 适配支持两类算子:ffn_worker_scheduler_/ffn_worker_scheduler、 attention_worker_scheduler_/attention_worker_scheduler Created-by: pwc10490 Commit-by: panweicheng Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > /kind feature **What does this PR do / why do we need it**: 适配支持两类算子: torch_npu.attention_worker_scheduler torch_npu.attention_worker_scheduler_(inplace算子) torch_npu.ffn_worker_scheduler torch_npu.ffn_worker_scheduler_(inplace算子) **Special notes for your reviewers**: See merge request: Ascend/op-plugin!33977 个月前
barrier Co-authored-by: davon<liuwenda4@huawei.com> # message auto-generated for no-merge-commit merge: !3372 merge master into master barrier Created-by: Davon14272 Commit-by: davon Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind feature **What does this PR do / why do we need it**: _npu_distribute_barrier adds two optional inputs: time_out and elastic_info **Special notes for your reviewers**: See merge request: Ascend/op-plugin!33727 个月前
适配支持两类算子:ffn_worker_scheduler_/ffn_worker_scheduler、 attention_worker_scheduler_/attention_worker_scheduler Co-authored-by: panweicheng<panweicheng@huawei.com> # message auto-generated for no-merge-commit merge: !3397 merge master into master 适配支持两类算子:ffn_worker_scheduler_/ffn_worker_scheduler、 attention_worker_scheduler_/attention_worker_scheduler Created-by: pwc10490 Commit-by: panweicheng Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > /kind feature **What does this PR do / why do we need it**: 适配支持两类算子: torch_npu.attention_worker_scheduler torch_npu.attention_worker_scheduler_(inplace算子) torch_npu.ffn_worker_scheduler torch_npu.ffn_worker_scheduler_(inplace算子) **Special notes for your reviewers**: See merge request: Ascend/op-plugin!33977 个月前
!3072 npu_fused_add_topk_div benchmark function modification Merge pull request !3072 from 史佳馨/master 9 个月前
!2448 ADD ALLTOALLVGMM AND GMMALLTOALLV Merge pull request !2448 from 梅文瑄/master 1 年前
!1782 move m to npu_op to struct codegen Merge pull request !1782 from 王广斌/new_struct3 1 年前
!2385 add HansEncode ans HansDeocde op api Merge pull request !2385 from Kingsleyandher/master 1 年前
!2100 fa bugfix Merge pull request !2100 from realmadrid1016/fa_bugfix 1 年前
!1200 Fix 910B skip ut use decorator Merge pull request !1200 from 王广斌/master_cherry 2 年前
!2711 Modify MlaProlog operator UT Merge pull request !2711 from 刘宇/master 11 个月前
!2711 Modify MlaProlog operator UT Merge pull request !2711 from 刘宇/master 11 个月前
mlaprologV3 update from master to 7.3.0 Co-authored-by: Le_666<yuanle2@huawei.com> # message auto-generated for no-merge-commit merge: !3706 merge 7.3.0 into 7.3.0 mlaprologV3 update from master to 7.3.0 Created-by: Le_666 Commit-by: Le_666 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: 主线对MlaPrologV3算子接口信息进行了修改,现已修改完毕,主线运行正常。目前根据需求将主线的稳定代码,同步至商分分支 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!37065 个月前
!1988 Fix some UTs on atlas_a2 part 3 Merge pull request !1988 from 张向龙3/fix_910b_ut_3 1 年前
!1988 Fix some UTs on atlas_a2 part 3 Merge pull request !1988 from 张向龙3/fix_910b_ut_3 1 年前
npu_mm_reduce_scatter_base support quant and comm_mode aiv Co-authored-by: wang-guangbin<wgb_strive@163.com> # message auto-generated for no-merge-commit merge: merge mm_rs into master npu_mm_reduce_scatter_base support quant and comm_mode aiv Created-by: wang-guangbin Commit-by: wang-guangbin Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: 本地验证通过 ![image.png](https://raw.gitcode.com/user-images/assets/7403085/3b23aa82-4d92-4ae0-9617-b48ea4f056e1/image.png 'image.png') See merge request: Ascend/op-plugin!31948 个月前
!2838 revert change of moe_distribute_dispatch/combine v1 interface Merge pull request !2838 from sincatter/master_dis_v1 10 个月前
mc2 dispatch&combine v2 add aclnnV4 for 7.3.0 Co-authored-by: yyoean<1114146082@qq.com> # message auto-generated for no-merge-commit merge: !3539 merge 7.3.0 into 7.3.0 mc2 dispatch&combine v2 add aclnnV4 for 7.3.0 Created-by: yyoean Commit-by: yyoean Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: https://gitcode.com/Ascend/op-plugin/issues/25 See merge request: Ascend/op-plugin!35396 个月前
!3062 DispatchV3/CombineV3/CombineaddrmsnormV2 update Merge pull request !3062 from Oblivion/master 8 个月前
!3062 DispatchV3/CombineV3/CombineaddrmsnormV2 update Merge pull request !3062 from Oblivion/master 8 个月前
!2838 revert change of moe_distribute_dispatch/combine v1 interface Merge pull request !2838 from sincatter/master_dis_v1 10 个月前
mc2 dispatch&combine v2 add aclnnV4 for 7.3.0 Co-authored-by: yyoean<1114146082@qq.com> # message auto-generated for no-merge-commit merge: !3539 merge 7.3.0 into 7.3.0 mc2 dispatch&combine v2 add aclnnV4 for 7.3.0 Created-by: yyoean Commit-by: yyoean Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: https://gitcode.com/Ascend/op-plugin/issues/25 See merge request: Ascend/op-plugin!35396 个月前
!3062 DispatchV3/CombineV3/CombineaddrmsnormV2 update Merge pull request !3062 from Oblivion/master 8 个月前
!2954 npu_moe_eplb_update_expert PTA Merge pull request !2954 from Oblivion/master 9 个月前
!1502 AddLayerNorm return FP32 when input different dtype Merge pull request !1502 from 何曾庚/hzg_mix 2 年前
!1964 Fix some UTs on atlas_a2 part 2 Merge pull request !1964 from 张向龙3/fix_910b_ut_2 1 年前
!1200 Fix 910B skip ut use decorator Merge pull request !1200 from 王广斌/master_cherry 2 年前
!2438 add new add_rms_norm_cast Merge pull request !2438 from liuyun/master 1 年前
Add handling logic for y2Out shape. Co-authored-by: 花无懿<liuxuehui4@huawei.com> # message auto-generated for no-merge-commit merge: !3771 merge 7.3.0 into 7.3.0 Add handling logic for y2Out shape. Created-by: huawuyi Commit-by: 花无懿 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!37715 个月前
npu_add_rms_norm_quant_v2 Co-authored-by: LG<luguang6@huawei.com> # message auto-generated for no-merge-commit merge: merge add_rms_norm_quant_v2 into master npu_add_rms_norm_quant_v2 Created-by: Lu_G Commit-by: LG Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > /kind feature **What does this PR do / why do we need it**: 修改npu_add_rms_norm_quant接口,增加beta入参,接入aclnnAddRMSNormQuantV2算子 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!31728 个月前
!1935 Fix some UTs on 910B Merge pull request !1935 from 张向龙3/fix_ut_910b 1 年前
!3031 add AdvanceStep to support speculative inference Merge pull request !3031 from 陈赵旻熠/AdvanceStep 8 个月前
support tnd format Co-authored-by: ghost<zhengwenbang@huawei.com> # message auto-generated for no-merge-commit merge: !3355 merge master into master support tnd format Created-by: weixin_44564637 Commit-by: ghost Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!33557 个月前
Add npu_attention_update interface Co-authored-by: huawuyi<liuxuehui4@huawei.com> # message auto-generated for no-merge-commit merge: merge master into master Add npu_attention_update interface Created-by: huawuyi Commit-by: huawuyi Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!31668 个月前
Add AttentionWorkerCombine PTA Co-authored-by: weixin_36491311<lingwanxin@huawei.com> # message auto-generated for no-merge-commit merge: merge master into master Add AttentionWorkerCombine PTA Created-by: weixin_36491311 Commit-by: weixin_36491311 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!31768 个月前
!2335 Add npu_attn_softmax. Merge pull request !2335 from 王夏夏/master 1 年前
!2454 remove ut skip Merge pull request !2454 from 沈珈靓/master 1 年前
add npu_clipped_swiglu Co-authored-by: jzj007<jiangzhijie9@huawei.com> # message auto-generated for no-merge-commit merge: !3527 merge 7.3.0CS into 7.3.0 add npu_clipped_swiglu Created-by: jzj007 Commit-by: jzj007 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind feature **What does this PR do / why do we need it**: 新增npu_clipped_swiglu接口 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!35276 个月前
!1964 Fix some UTs on atlas_a2 part 2 Merge pull request !1964 from 张向龙3/fix_910b_ut_2 1 年前
!2150 [Feature] Add npu_cross_entropy_loss. Merge pull request !2150 from 刘嘉巍/master 1 年前
!1200 Fix 910B skip ut use decorator Merge pull request !1200 from 王广斌/master_cherry 2 年前
!1200 Fix 910B skip ut use decorator Merge pull request !1200 from 王广斌/master_cherry 2 年前
!3111 add swiglu_mode,clamp_limit,glu_alpha,glu_bias attr for DequantSwigluQuant Merge pull request !3111 from ghost/master 8 个月前
!1600 add opapi of dropout_gen_mask Merge pull request !1600 from 赖长铃/custom_dropout_aclnn 1 年前
!3119 [Feat] Add npu_dynamic_block_quant docs and meta Merge pull request !3119 from zhangqiongwen/master_npu_dynamic_block_quant_doc 8 个月前
!1973 动态量化算子添加支持MOE和int4 Merge pull request !1973 from Tangmenhao/plugin_1110 1 年前
!1973 动态量化算子添加支持MOE和int4 Merge pull request !1973 from Tangmenhao/plugin_1110 1 年前
!1953 Prevent lambda input from being negative Merge pull request !1953 from zzx/master 1 年前
!2075 ffn ut update Merge pull request !2075 from zqh/1231 1 年前
add FfnWorkerBatching Co-authored-by: 安冬<andong2@huawei.com> # message auto-generated for no-merge-commit merge: merge master into master add FfnWorkerBatching Created-by: andong_hw Commit-by: 安冬 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!31798 个月前
remove keep_prob cast Co-authored-by: caihualilili<liangfuzhan@h-partners.com> # message auto-generated for no-merge-commit merge: merge keep_prob_remove_cast into master remove keep_prob cast Created-by: caihualilili Commit-by: caihualilili Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!32007 个月前
remove keep_prob cast Co-authored-by: caihualilili<liangfuzhan@h-partners.com> # message auto-generated for no-merge-commit merge: merge keep_prob_remove_cast into master remove keep_prob cast Created-by: caihualilili Commit-by: caihualilili Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!32007 个月前
[feat]npu_fusion_attention_v2 adapt for fav5 Co-authored-by: zqwenn<zhangqiongwen@huawei.com> # message auto-generated for no-merge-commit merge: !3522 merge 7.3.0_fav5_adpt into 7.3.0 [feat]npu_fusion_attention_v2 adapt for fav5 Created-by: zqwenn Commit-by: zqwenn Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: npu_fusion_attention_v2新增softmax_layout和sink入参 新增场景测试用例下一个pr提交 See merge request: Ascend/op-plugin!35226 个月前
remove keep_prob cast Co-authored-by: caihualilili<liangfuzhan@h-partners.com> # message auto-generated for no-merge-commit merge: merge keep_prob_remove_cast into master remove keep_prob cast Created-by: caihualilili Commit-by: caihualilili Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!32007 个月前
PA模式QKV D不等长场景下InferShape适配 Co-authored-by: cgagaboy<zengxiansheng1@huawei.com> # message auto-generated for no-merge-commit merge: !3346 merge master into master PA模式QKV D不等长场景下InferShape适配 Created-by: cgagaboy Commit-by: cgagaboy Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!33467 个月前
!3026 fused_infer_attention_score_v2算子支持sink功能,增加sink入参 Merge pull request !3026 from 邢文锦/master 8 个月前
support npu_fused_infer_attention_score_v2 tiling update in NPUGraph Co-authored-by: fengye<vinsmoke.feng@huawei.com> # message auto-generated for no-merge-commit merge: !3925 merge 7.3.0 into 7.3.0 support npu_fused_infer_attention_score_v2 tiling update in NPUGraph Created-by: fengye0903 Commit-by: fengye Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: #@unittest.skip( # "Skipping due to outdated CANN version; please update CANN to the latest version and remove this skip") #@SupportedDevices(['Ascend910B']) def test_npu_fused_infer_attention_score_v2_ifa_return_lse_return_softmax_lse_true_with_workspace(self, device="npu"): query = torch.randn(1, 32, 1, 128, dtype=torch.float16).npu() key = torch.randn(1, 32, 2048, 128, dtype=torch.float16).npu() value = torch.randn(1, 32, 2048, 128, dtype=torch.float16).npu() head_dim = 128 return_softmax_lse = True supported_output, lse_out = self.supported_op_exec(query, key, value, head_dim, 1, 32, 1, return_softmax_lse) custom_output = self.custom_op_exec_with_workspace(query, key, value, head_dim, True) attention_output = custom_output[0] softmaxlse_output = custom_output[1] if __name__ == "__main__": run_tests() [root@devserver-hps-feb34b1c-00033 f00941739]# python ./op-plugin/test/test_custom_ops/test_npu_fused_infer_attention_score_v2_with_workspace.py ss. ---------------------------------------------------------------------- Ran 3 tests in 2.006s OK (skipped=2) **Special notes for your reviewers**: See merge request: Ascend/op-plugin!39255 个月前
add _npu_fused_infer_attention_score_infer_output Co-authored-by: yanpengquan<yanpengquan@huawei.com> # message auto-generated for no-merge-commit merge: !3930 merge 7.3.0_fused into 7.3.0 add _npu_fused_infer_attention_score_infer_output Created-by: wangchao430 Commit-by: yanpengquan Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: ![image.png](https://raw.gitcode.com/user-images/assets/7403085/52a37a39-91ad-4342-924f-b9e3f8f6e9cb/image.png 'image.png') See merge request: Ascend/op-plugin!39305 个月前
!3085 [Feat] Add npu_gather_sparse_index_backward adpt Merge pull request !3085 from zhangqiongwen/master_npu_gather_sparse_index_backward 8 个月前
!1965 GeluV2/GeluGradV2 PTA接口与入图功能代码修改 Merge pull request !1965 from 禹广飞/geluv2_pta 1 年前
!1965 GeluV2/GeluGradV2 PTA接口与入图功能代码修改 Merge pull request !1965 from 禹广飞/geluv2_pta 1 年前
Added support for the npu_gelu_mul operator. Co-authored-by: shi-jiaxin9<shijiaxin10@h-partners.com> # message auto-generated for no-merge-commit merge: !3418 merge master into master Added support for the npu_gelu_mul operator. Created-by: shi-jiaxin9 Commit-by: shi-jiaxin9 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind feature **What does this PR do / why do we need it**: Added support for the npu_gelu_mul operator. **Special notes for your reviewers**: See merge request: Ascend/op-plugin!34186 个月前
!2208 add npu_gemma_rms_norm Merge pull request !2208 from liuyun/master 1 年前
!2125 add group_norm_swish and group_norm_swish_grad Merge pull request !2125 from 詹昊/groupnormswish 1 年前
!1839 add new op group_quant Merge pull request !1839 from 田野/dev_groupquant 1 年前
npu_quantize接口input数据格式支持NZ Co-authored-by: wkp<1289706727@qq.com> # message auto-generated for no-merge-commit merge: merge master-quantize into master npu_quantize接口input数据格式支持NZ Created-by: kunpengW Commit-by: wkp Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: ![image.png](https://raw.gitcode.com/user-images/assets/7403085/0984ff44-7fd0-4bd0-aac1-f621b7a6a750/image.png 'image.png') See merge request: Ascend/op-plugin!32198 个月前
update meta func Co-authored-by: Zichun Ye<zichun.ye@huawei.com> # message auto-generated for no-merge-commit merge: !3791 merge meta_update_730 into 7.3.0 update meta func Created-by: zichun_ye Commit-by: Zichun Ye Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!37915 个月前
GMMFR adds support for cumsum in groulist and none option in shareInput. Co-authored-by: lilening<lilening@huawei.com> # message auto-generated for no-merge-commit merge: !3581 merge 7.3.0 into 7.3.0 GMMFR adds support for cumsum in groulist and none option in shareInput. Created-by: lilening Commit-by: lilening Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind feature **What does this PR do / why do we need it**: GMMFR adds support for cumsum in groulist and none option in shareInput. **Special notes for your reviewers**: See merge request: Ascend/op-plugin!35816 个月前
!2630 reshapeandcache support nz and dtype int8 Merge pull request !2630 from 史佳馨/master 11 个月前
create pta of gmm_swiglu_v2. Co-authored-by: liuyan121<liuyan171@huawei.com> # message auto-generated for no-merge-commit merge: !3369 merge master into master create pta of gmm_swiglu_v2. Created-by: liuyan121 Commit-by: liuyan121 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!33697 个月前
!2243 npu_kronecker_quant添加可选参数clip_ratio Merge pull request !2243 from 陈海杰/master 1 年前
新增算子LI\QLI\SFA\QSFA Co-authored-by: zzzyh22<zhangyaohui22@huawei.com> # message auto-generated for no-merge-commit merge: !3521 merge 7.3.0 into 7.3.0 新增算子LI\QLI\SFA\QSFA Created-by: zzzyh22 Commit-by: zzzyh22 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > /kind bug > /kind task > >kind feature **What does this PR do / why do we need it**: 根据需求新增LI\QLI\SFA\QSFA四个算子,其中 SFA算子通过“只计算关键部分”大幅减少计算量,优化算子性能。 QSFA在SFA的基础上支持了Per-Token-Head-Tile-128量化输入。 LI算子基于一系列操作得到每一个 token 对应的 Top-k个位置。 LIQ在LI的基础上支持了Per-Token-Head量化输入。 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!35216 个月前
add kv_rms_norm_rope_cache v2 Co-authored-by: 15084323657<xulin91@huawei.com> # message auto-generated for no-merge-commit merge: !3528 merge KvRmsNormRopeCacheV2 into 7.3.0 add kv_rms_norm_rope_cache v2 Created-by: Alen2434 Commit-by: 15084323657 Merged-by: ascend-robot Description: 算子新增v2功能实现: kv_rms_norm_rope_cache_支持kv分离(新增可选输入tensor v)和ROPE(RMS(Dk))计算 tiling,kernel侧,增加非对称量化和Norm下量化功能。 实现输入kv N维度扩展 <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Special notes for your reviewers**: See merge request: Ascend/op-plugin!35286 个月前
!1304 Bugfix: fix the problem of optional parameter in npu_layer_norm_eval Merge pull request !1304 from 王广斌/master_cherry 2 年前
新增算子LI\QLI\SFA\QSFA Co-authored-by: zzzyh22<zhangyaohui22@huawei.com> # message auto-generated for no-merge-commit merge: !3521 merge 7.3.0 into 7.3.0 新增算子LI\QLI\SFA\QSFA Created-by: zzzyh22 Commit-by: zzzyh22 Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > /kind bug > /kind task > >kind feature **What does this PR do / why do we need it**: 根据需求新增LI\QLI\SFA\QSFA四个算子,其中 SFA算子通过“只计算关键部分”大幅减少计算量,优化算子性能。 QSFA在SFA的基础上支持了Per-Token-Head-Tile-128量化输入。 LI算子基于一系列操作得到每一个 token 对应的 Top-k个位置。 LIQ在LI的基础上支持了Per-Token-Head量化输入。 **Special notes for your reviewers**: See merge request: Ascend/op-plugin!35216 个月前
!1433 add moe compute tokens PTA Merge pull request !1433 from 冯佳伟/test 2 年前