| [Performance] add op chunk_fwd_o and chunk_gated_delta_rule_fwd_h (#9018)
### What this PR does / why we need it?
add custom op for performance improve: chunk_fwd_o &
chunk_gated_delta_rule_fwd_h
- vLLM version: v0.19.1
- vLLM main:
https://github.com/vllm-project/vllm/commit/4d51588e2381018348f1022dfa3a7698899805b7
Signed-off-by: AlanisZomeg <1308342839@qq.com> | 17 天前 |