文件最后提交记录最后更新时间
[Feature] Update custom op build framework (#8146) ## Summary - update the custom-op build and packaging framework - align current custom-op integration under csrc while keeping the current main-branch operator implementations - improve custom-op runtime environment bootstrap so single-op tests and offline service no longer depend on manually sourcing vendor env scripts - keep the non-custom-op build path unchanged ## Notes - this PR focuses on the custom-op build framework update and related runtime loading path changes - operator implementations continue to follow the current main-branch codebase - final validation for this PR state relies on the CI results on GitHub ## Validation - remote A2 targeted verification completed for custom-op build/install and runtime bootstrap - full CI is used as the final validation gate for this PR state - vLLM version: - vLLM main: https://github.com/vllm-project/vllm/commit/v0.19.0 Signed-off-by: maoxx241 <maomaoyu870@gmail.com>18 天前
[Kernel] add custom op GmmSwigluQuantWeightNzTensorList (#3804) ### What this PR does / why we need it? This PR introduces support for adding custom CANN aclnn ops to vllm-ascend, allowing users to define and use their own custom operators. Key changes include: - Building and installing custom ops into the vllm-ascend-specified directory - Binding the aclnn op interface to the torch.ops._C_ascend module - Enabling invocation of these ops within vllm-ascend This PR includes a sample custom op: aclnnGroupedMatmulSwigluQuantWeightNzTensorList, which is adapted from the CANN operator [aclnnGroupedMatmulSwigluQuantWeightNZ](https://www.hiascend.com/document/detail/zh/canncommercial/83RC1/API/aolapi/context/aclnnGroupedMatmulSwigluQuantWeightNZ.md). Its input parameters weight and weight_scale now accept list[torch.Tensor] (i.e., at::TensorList). ### Does this PR introduce _any_ user-facing change? No. - vLLM version: v0.11.2 --------- Signed-off-by: QianChenxi <chenxi.qian.cq@outlook.com>5 个月前
[Kernel] add custom op GmmSwigluQuantWeightNzTensorList (#3804) ### What this PR does / why we need it? This PR introduces support for adding custom CANN aclnn ops to vllm-ascend, allowing users to define and use their own custom operators. Key changes include: - Building and installing custom ops into the vllm-ascend-specified directory - Binding the aclnn op interface to the torch.ops._C_ascend module - Enabling invocation of these ops within vllm-ascend This PR includes a sample custom op: aclnnGroupedMatmulSwigluQuantWeightNzTensorList, which is adapted from the CANN operator [aclnnGroupedMatmulSwigluQuantWeightNZ](https://www.hiascend.com/document/detail/zh/canncommercial/83RC1/API/aolapi/context/aclnnGroupedMatmulSwigluQuantWeightNZ.md). Its input parameters weight and weight_scale now accept list[torch.Tensor] (i.e., at::TensorList). ### Does this PR introduce _any_ user-facing change? No. - vLLM version: v0.11.2 --------- Signed-off-by: QianChenxi <chenxi.qian.cq@outlook.com>5 个月前