| [BugFix][310p] Fixing the aclgraph error caused by blocktable (#8948)
### What this PR does / why we need it?
This PR fixes an ACL Graph error on Ascend 310P devices by moving the
block table's slot mapping computation to the CPU. On 310P, certain
device-side arithmetic operations used in the default slot mapping
computation are unsupported or cause errors during graph execution.
Key changes:
- Overrode BlockTable for 310P to use NumPy for slot mapping
computation.
- Updated NPUModelRunner to perform this computation on the CPU early
in the input preparation phase.
- Avoided unsupported device-side additions for positions and
seq_lens on 310P by using CPU buffers.
### Does this PR introduce _any_ user-facing change?
No.
### How was this patch tested?
Verified on Ascend 310P hardware with vLLM v0.19.1.
- vLLM version: v0.19.1
- vLLM main:
https://github.com/vllm-project/vllm/commit/d886c26d4d4fef7d079696beb4ece1cfb4b008a8
---------
Signed-off-by: Tflowers-0129 <2906339855@qq.com> | 9 天前 |