文件最后提交记录最后更新时间
feat: conv1d wrapper Co-authored-by: liuxi_<liuxi75@huawei.com> 25 天前
feat: add RmsNormGated Co-authored-by: feng0w0<houyufeng4@huawei.com> # message auto-generated for no-merge-commit merge: !22 merge master into master feat: add RmsNormGated Created-by: feng0w0 Commit-by: feng0w0 Merged-by: ascend-robot Description: ## What this PR does / why we need it? add RmsNormGated Triton [#1](https://gitcode.com/Ascend/MindSpeed-Ops/issues/1) ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? ut ![image.png](https://raw.gitcode.com/user-images/assets/9612429/12418371-1f2d-4966-ab96-986077b3c558/image.png 'image.png') atk 精度 ![image.png](https://raw.gitcode.com/user-images/assets/9612429/3879ca4d-ab45-42c2-ab46-fa2920736536/image.png 'image.png') atk性能 ![image.png](https://raw.gitcode.com/user-images/assets/9612429/5efab269-aefb-4d02-8be5-fe36df89b857/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!229 天前
feat: add chunk_gated_delta_rule_fwd_h operator Co-authored-by: LinShua<707894133@qq.com> # message auto-generated for no-merge-commit merge: !28 merge master_fwd_h into master feat: add chunk_gated_delta_rule_fwd_h operator Created-by: LinShua Commit-by: LinShua Merged-by: ascend-robot Description: ## What this PR does / why we need it? add chunk_gated_delta_rule_fwd_h operator [#related roadmap](https://gitcode.com/Ascend/MindSpeed-Ops/issues/1) ## Does this PR introduce any user-facing change? Reference Operator Markdown Description. ## How was this patch tested? UT and ATK ![image.png](https://raw.gitcode.com/user-images/assets/9612429/efc6d84c-5b47-46e5-8b4f-e47cffe169fe/image.png 'image.png') ![image.png](https://raw.gitcode.com/user-images/assets/9612429/0c0e8daa-1066-4034-9dfd-565b7e182c11/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!288 天前
[CI][Fix] Temporarily Skip chunk_kda_bwd precision UT Co-authored-by: zhuweichen<calvin_zhu0210@outlook.com> # message auto-generated for no-merge-commit merge: !36 merge ut_fix into master [CI][Fix] Temporarily Skip chunk_kda_bwd precision UT Created-by: zhuweichen Commit-by: zhuweichen Merged-by: ascend-robot Description: ## What this PR does / why we need it? Temporarily Skip chunk_kda_bwd precision UT ## Does this PR introduce any user-facing change? No ## How was this patch tested? Irrelevant See merge request: Ascend/MindSpeed-Ops!3612 天前
feat: Add FusedCrossEntropyLoss for Qwen3.5 Co-authored-by: liu_zhi_xu<liuzhexu1@huawei.com> # message auto-generated for no-merge-commit merge: !25 merge new_func into master feat: Add FusedCrossEntropyLoss for Qwen3.5 Created-by: liu_zhi_xu Commit-by: liu_zhi_xu Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1、New model adaptation operator completion [#related roadmap](https://gitcode.com/Ascend/MindSpeed-Ops/issues/1) 2、Modify ATK config related RMS/SINK ## Does this PR introduce any user-facing change? Reference Operator Markdown Description ## How was this patch tested? [UT] pytest test_fused_cross_entropy_loss.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/2afb8d87-6bf6-4113-8078-1ea9eb30aebc/image.png 'image.png') [ATK] atk case -f rmsnorm_without_weight.yaml -p generate_rmsnorm_without_weight.py atk node --backend triton --devices 0 node --backend cpu --devices 0 task -c result/rmsnorm_without_weight/json/all_rmsnorm_without_weight.json --task accuracy -p triton_rmsnorm_without_weight.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/2440a7b5-0522-4e2e-a995-26623dc16d46/image.png 'image.png') atk case -f sinkhorn.yaml -p generate_sinkhorn.py atk node --backend triton --devices 0 node --backend cpu --devices 0 task -c result/sinkhorn/json/all_sinkhorn.json --task accuracy -p triton_sinkhorn.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/6bba5494-02a3-427d-b7d4-30997cc8a91a/image.png 'image.png') atk case -f fused_cross_entropy_loss.yaml -p generate_fused_cross_entropy_loss.py atk node --backend triton --devices 0 node --backend npu --devices 0 task -c result/fused_cross_entropy_loss/json/all_fused_cross_entropy_loss.json --task accuracy -p triton_fused_cross_entropy_loss.py atk node --backend triton --devices 0 node --backend npu --devices 0 task -c result/fused_cross_entropy_loss/json/all_fused_cross_entropy_loss.json --task performance_device -p triton_fused_cross_entropy_loss.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/ba106c72-2104-4d55-82c4-87203544529d/image.png 'image.png') ![image.png](https://raw.gitcode.com/user-images/assets/9612429/d7951ef0-e096-430c-9030-3e690e64d807/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!2519 天前
feat: Add FusedCrossEntropyLoss for Qwen3.5 Co-authored-by: liu_zhi_xu<liuzhexu1@huawei.com> # message auto-generated for no-merge-commit merge: !25 merge new_func into master feat: Add FusedCrossEntropyLoss for Qwen3.5 Created-by: liu_zhi_xu Commit-by: liu_zhi_xu Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1、New model adaptation operator completion [#related roadmap](https://gitcode.com/Ascend/MindSpeed-Ops/issues/1) 2、Modify ATK config related RMS/SINK ## Does this PR introduce any user-facing change? Reference Operator Markdown Description ## How was this patch tested? [UT] pytest test_fused_cross_entropy_loss.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/2afb8d87-6bf6-4113-8078-1ea9eb30aebc/image.png 'image.png') [ATK] atk case -f rmsnorm_without_weight.yaml -p generate_rmsnorm_without_weight.py atk node --backend triton --devices 0 node --backend cpu --devices 0 task -c result/rmsnorm_without_weight/json/all_rmsnorm_without_weight.json --task accuracy -p triton_rmsnorm_without_weight.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/2440a7b5-0522-4e2e-a995-26623dc16d46/image.png 'image.png') atk case -f sinkhorn.yaml -p generate_sinkhorn.py atk node --backend triton --devices 0 node --backend cpu --devices 0 task -c result/sinkhorn/json/all_sinkhorn.json --task accuracy -p triton_sinkhorn.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/6bba5494-02a3-427d-b7d4-30997cc8a91a/image.png 'image.png') atk case -f fused_cross_entropy_loss.yaml -p generate_fused_cross_entropy_loss.py atk node --backend triton --devices 0 node --backend npu --devices 0 task -c result/fused_cross_entropy_loss/json/all_fused_cross_entropy_loss.json --task accuracy -p triton_fused_cross_entropy_loss.py atk node --backend triton --devices 0 node --backend npu --devices 0 task -c result/fused_cross_entropy_loss/json/all_fused_cross_entropy_loss.json --task performance_device -p triton_fused_cross_entropy_loss.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/ba106c72-2104-4d55-82c4-87203544529d/image.png 'image.png') ![image.png](https://raw.gitcode.com/user-images/assets/9612429/d7951ef0-e096-430c-9030-3e690e64d807/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!2519 天前
add mhc ops Co-authored-by: wangxuefei10<wangxuefei10@huawei.com> # message auto-generated for no-merge-commit merge: !35 merge dev_mhc_0525 into master feat: add mhc triton ops Created-by: Ling_i Commit-by: wangxuefei10 Merged-by: ascend-robot Description: ## What this PR does / why we need it? add mhc triton ops https://gitcode.com/Ascend/MindSpeed-Ops/issues/1 ## Does this PR introduce any user-facing change? Reference Operator Markdown Description. ## How was this patch tested? UT and UTK ![image.png](https://raw.gitcode.com/user-images/assets/9612429/f75f1ab2-4d23-4dc6-aacf-3d6e95d335b6/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!3510 天前
add mhc ops Co-authored-by: wangxuefei10<wangxuefei10@huawei.com> # message auto-generated for no-merge-commit merge: !35 merge dev_mhc_0525 into master feat: add mhc triton ops Created-by: Ling_i Commit-by: wangxuefei10 Merged-by: ascend-robot Description: ## What this PR does / why we need it? add mhc triton ops https://gitcode.com/Ascend/MindSpeed-Ops/issues/1 ## Does this PR introduce any user-facing change? Reference Operator Markdown Description. ## How was this patch tested? UT and UTK ![image.png](https://raw.gitcode.com/user-images/assets/9612429/f75f1ab2-4d23-4dc6-aacf-3d6e95d335b6/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!3510 天前
add mhc ops Co-authored-by: wangxuefei10<wangxuefei10@huawei.com> # message auto-generated for no-merge-commit merge: !35 merge dev_mhc_0525 into master feat: add mhc triton ops Created-by: Ling_i Commit-by: wangxuefei10 Merged-by: ascend-robot Description: ## What this PR does / why we need it? add mhc triton ops https://gitcode.com/Ascend/MindSpeed-Ops/issues/1 ## Does this PR introduce any user-facing change? Reference Operator Markdown Description. ## How was this patch tested? UT and UTK ![image.png](https://raw.gitcode.com/user-images/assets/9612429/f75f1ab2-4d23-4dc6-aacf-3d6e95d335b6/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!3510 天前
feat: add recompute_w_u_fwd triton operator Co-authored-by: LinShua<707894133@qq.com> # message auto-generated for no-merge-commit merge: !31 merge master_recompute_w_u_fwd into master feat: add recompute_w_u_fwd triton operator Created-by: LinShua Commit-by: LinShua Merged-by: ascend-robot Description: ## What this PR does / why we need it? add recompute_w_u_fwd triton operator [#related roadmap](https://gitcode.com/Ascend/MindSpeed-Ops/issues/1) ## Does this PR introduce any user-facing change? Reference Operator Markdown Description. ## How was this patch tested? UT and ATK ![image.png](https://raw.gitcode.com/user-images/assets/9612429/38859bc3-71f3-4213-9443-c60617336ead/image.png 'image.png') ![image.png](https://raw.gitcode.com/user-images/assets/9612429/9b3013a5-7e99-4a28-8eb8-82cfb89c7767/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!318 天前
feat:Add Sinkhorn for DS V4 Co-authored-by: liu_zhi_xu<liuzhexu1@huawei.com> # message auto-generated for no-merge-commit merge: !24 merge sinkhorn into master feat:Add Sinkhorn for DS V4 Created-by: liu_zhi_xu Commit-by: liu_zhi_xu Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1、New model adaptation operator completion [#related roadmap](https://gitcode.com/Ascend/MindSpeed-Ops/issues/1) 2、Operator Description Enhancement for rmsnorm_without_weight 3、Refactor the common test func related Add/RMS/SINK ## Does this PR introduce any user-facing change? Reference Operator Markdown Description ## How was this patch tested? [UT] pytest test_sinkhorn.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/d595acb8-cc1d-4b98-b7bc-4c46d3a586a9/image.png 'image.png') [ATK] atk case -f sinkhorn.yaml -p generate_input.py atk node --backend triton --devices 0 node --backend npu --devices 0 task -c result/sinkhorn/json/all_sinkhorn.json --task performance_device -p sinkhorn.py atk node --backend triton --devices 0 node --backend npu --devices 0 task -c result/sinkhorn/json/all_sinkhorn.json --task accuracy -p sinkhorn.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/fd979bb5-ece5-4eac-adf9-5cca7a236279/image.png 'image.png') ![image.png](https://raw.gitcode.com/user-images/assets/9612429/b4809f5d-78a1-4673-9251-2915e6055f8b/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!2422 天前
feat:Add Sinkhorn for DS V4 Co-authored-by: liu_zhi_xu<liuzhexu1@huawei.com> # message auto-generated for no-merge-commit merge: !24 merge sinkhorn into master feat:Add Sinkhorn for DS V4 Created-by: liu_zhi_xu Commit-by: liu_zhi_xu Merged-by: ascend-robot Description: ## What this PR does / why we need it? 1、New model adaptation operator completion [#related roadmap](https://gitcode.com/Ascend/MindSpeed-Ops/issues/1) 2、Operator Description Enhancement for rmsnorm_without_weight 3、Refactor the common test func related Add/RMS/SINK ## Does this PR introduce any user-facing change? Reference Operator Markdown Description ## How was this patch tested? [UT] pytest test_sinkhorn.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/d595acb8-cc1d-4b98-b7bc-4c46d3a586a9/image.png 'image.png') [ATK] atk case -f sinkhorn.yaml -p generate_input.py atk node --backend triton --devices 0 node --backend npu --devices 0 task -c result/sinkhorn/json/all_sinkhorn.json --task performance_device -p sinkhorn.py atk node --backend triton --devices 0 node --backend npu --devices 0 task -c result/sinkhorn/json/all_sinkhorn.json --task accuracy -p sinkhorn.py ![image.png](https://raw.gitcode.com/user-images/assets/9612429/fd979bb5-ece5-4eac-adf9-5cca7a236279/image.png 'image.png') ![image.png](https://raw.gitcode.com/user-images/assets/9612429/b4809f5d-78a1-4673-9251-2915e6055f8b/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!2422 天前
feat: prepare_wy_repr ops Co-authored-by: liuxi_<liuxi75@huawei.com> # message auto-generated for no-merge-commit merge: !20 merge prepare_wy_repr into master feat: prepare_wy_repr ops Created-by: liuxi_ Commit-by: liuxi_ Merged-by: ascend-robot Description: ## What this PR does / why we need it? prepare_wy_repr ops [#1](https://gitcode.com/Ascend/MindSpeed-Ops/issues/1) ## Does this PR introduce any user-facing change? 不涉及 ## How was this patch tested? TA 3.2.1 ut ![image.png](https://raw.gitcode.com/user-images/assets/9612429/e080ebd9-8a25-45ae-ad87-dccda27e1e77/image.png 'image.png') atk ![image.png](https://raw.gitcode.com/user-images/assets/9612429/78ebd5d6-8a38-4316-8ee3-ff1a055f75eb/image.png 'image.png') ![image.png](https://raw.gitcode.com/user-images/assets/9612429/2c25d24b-773d-43c0-a714-4461a6cf51e9/image.png 'image.png') See merge request: Ascend/MindSpeed-Ops!2018 天前