| feat: upgrade management and routing improvements Major features: - Add elastic management for frequency tracking, load tracking, and routing broadcasts. - Add routing service APIs and expertkit-transport-rs with gRPC, shared-memory, RDMA, and load-balancing strategies. - Integrate worker-side weight management, peer weight transfer, byte-indexed weight serving. - Expand model/runtime support for DeepSeek v2/v3, Mixtral, Qwen3-MoE, vLLM integration, and Aliyun/local validation configs. Important fixes: - Fix fallback path stalls, worker disconnect fallback, torch backend device-index recognition, and DeepSeek-v2-lite runnable support. - Improve recovery by batching recovery upserts, deduplicating recovery triggers, keeping routing stable. - Align controller/worker liveness checks, routing subscriptions. | 8 天前 |