文件最后提交记录最后更新时间
[Kernel] add custom op GmmSwigluQuantWeightNzTensorList (#3804) ### What this PR does / why we need it? This PR introduces support for adding custom CANN aclnn ops to vllm-ascend, allowing users to define and use their own custom operators. Key changes include: - Building and installing custom ops into the vllm-ascend-specified directory - Binding the aclnn op interface to the torch.ops._C_ascend module - Enabling invocation of these ops within vllm-ascend This PR includes a sample custom op: aclnnGroupedMatmulSwigluQuantWeightNzTensorList, which is adapted from the CANN operator [aclnnGroupedMatmulSwigluQuantWeightNZ](https://www.hiascend.com/document/detail/zh/canncommercial/83RC1/API/aolapi/context/aclnnGroupedMatmulSwigluQuantWeightNZ.md). Its input parameters weight and weight_scale now accept list[torch.Tensor] (i.e., at::TensorList). ### Does this PR introduce _any_ user-facing change? No. - vLLM version: v0.11.2 --------- Signed-off-by: QianChenxi <chenxi.qian.cq@outlook.com>5 个月前
[Kernel] add custom op GmmSwigluQuantWeightNzTensorList (#3804) ### What this PR does / why we need it? This PR introduces support for adding custom CANN aclnn ops to vllm-ascend, allowing users to define and use their own custom operators. Key changes include: - Building and installing custom ops into the vllm-ascend-specified directory - Binding the aclnn op interface to the torch.ops._C_ascend module - Enabling invocation of these ops within vllm-ascend This PR includes a sample custom op: aclnnGroupedMatmulSwigluQuantWeightNzTensorList, which is adapted from the CANN operator [aclnnGroupedMatmulSwigluQuantWeightNZ](https://www.hiascend.com/document/detail/zh/canncommercial/83RC1/API/aolapi/context/aclnnGroupedMatmulSwigluQuantWeightNZ.md). Its input parameters weight and weight_scale now accept list[torch.Tensor] (i.e., at::TensorList). ### Does this PR introduce _any_ user-facing change? No. - vLLM version: v0.11.2 --------- Signed-off-by: QianChenxi <chenxi.qian.cq@outlook.com>5 个月前
[Kernel] add custom op GmmSwigluQuantWeightNzTensorList (#3804) ### What this PR does / why we need it? This PR introduces support for adding custom CANN aclnn ops to vllm-ascend, allowing users to define and use their own custom operators. Key changes include: - Building and installing custom ops into the vllm-ascend-specified directory - Binding the aclnn op interface to the torch.ops._C_ascend module - Enabling invocation of these ops within vllm-ascend This PR includes a sample custom op: aclnnGroupedMatmulSwigluQuantWeightNzTensorList, which is adapted from the CANN operator [aclnnGroupedMatmulSwigluQuantWeightNZ](https://www.hiascend.com/document/detail/zh/canncommercial/83RC1/API/aolapi/context/aclnnGroupedMatmulSwigluQuantWeightNZ.md). Its input parameters weight and weight_scale now accept list[torch.Tensor] (i.e., at::TensorList). ### Does this PR introduce _any_ user-facing change? No. - vLLM version: v0.11.2 --------- Signed-off-by: QianChenxi <chenxi.qian.cq@outlook.com>5 个月前
[Kernel] add custom op GmmSwigluQuantWeightNzTensorList (#3804) ### What this PR does / why we need it? This PR introduces support for adding custom CANN aclnn ops to vllm-ascend, allowing users to define and use their own custom operators. Key changes include: - Building and installing custom ops into the vllm-ascend-specified directory - Binding the aclnn op interface to the torch.ops._C_ascend module - Enabling invocation of these ops within vllm-ascend This PR includes a sample custom op: aclnnGroupedMatmulSwigluQuantWeightNzTensorList, which is adapted from the CANN operator [aclnnGroupedMatmulSwigluQuantWeightNZ](https://www.hiascend.com/document/detail/zh/canncommercial/83RC1/API/aolapi/context/aclnnGroupedMatmulSwigluQuantWeightNZ.md). Its input parameters weight and weight_scale now accept list[torch.Tensor] (i.e., at::TensorList). ### Does this PR introduce _any_ user-facing change? No. - vLLM version: v0.11.2 --------- Signed-off-by: QianChenxi <chenxi.qian.cq@outlook.com>5 个月前
[CI][lint] Add rule codespell back (#6236) ### What this PR does / why we need it? After removing codepsell a while, we discovered that typo had a problem correctly recognizing certain misspelled words, so I suggested adding it back. - vLLM version: v0.14.1 - vLLM main: https://github.com/vllm-project/vllm/commit/d68209402ddab3f54a09bc1f4de9a9495a283b60 --------- Signed-off-by: wangli <wangli858794774@gmail.com>3 个月前