Star679
378
代码介绍
代码
Issues26
Pull Requests43
流水线
Actions
讨论
Wiki
项目成员245
分析
项目设置
Star679
378
  1. cann-recipes-infer
  2. /
  3. executor
  4. /
  5. core
cann-robotcann-robot【feat】feat multi block_size in cache management
48a59dc0创建于 2 天前历史提交
文件最后提交记录最后更新时间
config
qwen3_5 support fp8 and mxfp8 quantization5 天前
engine
【feat】feat multi block_size in cache management2 天前
forward_data_info
【feat】feat multi block_size in cache management2 天前
kv_cache
【feat】feat multi block_size in cache management2 天前
model_worker
【feat】feat multi block_size in cache management2 天前
scheduler
feat: support qwen3.522 天前
__init__.py
[docs] add framwork checklist25 天前
support_models.py
[refactor] gemma-4: paged attention + TND + MC2 EP decode5 天前