| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
feat: impl mock work server feat[ExpertKitMoE]: add drop duplicate pad token from forward hidden state feat[ExpertKitMoE]: finish test for expertkit-vllm feat: test client for mock server TODO: seens bugs in vllm mla (tensor not in same device) | 1 年前 | |
feat[integration]: impl plugin for vllm, need for test | 1 年前 | |
feat: vLLM plugin support (#53) * feat: add config for ek-vllm plugin feat: integrate vLLM with ek framework and fix token output issues fix: add missing shared expert output summation in DeepSeek MoE forward pass * typo: unify env variables from EXPERTKIT to EK fix: correct vllm output * doc: Update vLLM plugin docs for latest configuration method * fix: Change the example model path from a local directory path to a Hugging Face model name. | 1 年前 |
| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
| 1 年前 | ||
| 1 年前 | ||
| 1 年前 |