| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
feat: upgrade management and routing improvements Major features: - Add elastic management for frequency tracking, load tracking, and routing broadcasts. - Add routing service APIs and expertkit-transport-rs with gRPC, shared-memory, RDMA, and load-balancing strategies. - Integrate worker-side weight management, peer weight transfer, byte-indexed weight serving. - Expand model/runtime support for DeepSeek v2/v3, Mixtral, Qwen3-MoE, vLLM integration, and Aliyun/local validation configs. Important fixes: - Fix fallback path stalls, worker disconnect fallback, torch backend device-index recognition, and DeepSeek-v2-lite runnable support. - Improve recovery by batching recovery upserts, deduplicating recovery triggers, keeping routing stable. - Align controller/worker liveness checks, routing subscriptions. | 10 天前 | |
feat: rust version of create-model. update model config to use weight-server address | 1 年前 | |
dev: implement core of state io trait | 1 年前 | |
perf: remove useless locks & replace 3rd OnceCell with std OnceLock (#85) * perf: remove useless locks & replace 3rd OnceCell with std OnceLock * chore: clippy & format | 10 个月前 | |
feat: upgrade management and routing improvements Major features: - Add elastic management for frequency tracking, load tracking, and routing broadcasts. - Add routing service APIs and expertkit-transport-rs with gRPC, shared-memory, RDMA, and load-balancing strategies. - Integrate worker-side weight management, peer weight transfer, byte-indexed weight serving. - Expand model/runtime support for DeepSeek v2/v3, Mixtral, Qwen3-MoE, vLLM integration, and Aliyun/local validation configs. Important fixes: - Fix fallback path stalls, worker disconnect fallback, torch backend device-index recognition, and DeepSeek-v2-lite runnable support. - Improve recovery by batching recovery upserts, deduplicating recovery triggers, keeping routing stable. - Align controller/worker liveness checks, routing subscriptions. | 10 天前 |