| add kirin9030 ops
Co-authored-by: zengjuan<zengjuan2@huawei.com>
# message auto-generated for no-merge-commit merge:
!1118 merge master into master
add kirin9030 ops
Created-by: zengjuan
Commit-by: zengjuan
Merged-by: cann-robot
Description: ## 描述
add kirin9030 ops:
attention/nsa_compress_with_cache
attention/nsa_selected_attention_infer
attention/ring_attention_update
ffn/ffn
ffn/swin_attention_ffn
ffn/swin_transformer_ln_qkv
ffn/swin_transformer_ln_qkv_quant
gmm/grouped_matmul
gmm/grouped_matmul_swiglu_quant
moe/moe_compute_expert_tokens
moe/moe_finalize_routing
moe/moe_finalize_routing_v2
moe/moe_re_routing
moe/moe_token_unpermute
moe/moe_token_unpermute_with_ep
moe/moe_token_unpermute_with_routing_map
posembedding/apply_rotary_pos_emb
posembedding/dequant_rope_quant_kvcache
posembedding/interleave_rope
posembedding/kv_rms_norm_rope_cache
posembedding/rope_quant_kvcache
posembedding/rope_with_sin_cos_cache
posembedding/rotary_position_embedding
## 关联的Issue
https://gitcode.com/cann/ops-transformer/issues/671
## 测试
蓝区门禁、黄区门禁、算子二级冒烟
## 文档更新
无
## 类型标签
<!-- [x] 表示选中 -->
- [ ] Bug修复
- [x] 新特性
- [ ] 性能优化
- [ ] 文档更新
- [ ] 其他,请描述:
See merge request: cann/ops-transformer!1118 | 3 个月前 |