Star679
378
代码介绍
代码
Issues26
Pull Requests43
流水线
Actions
讨论
Wiki
项目成员245
分析
项目设置
Star679
378
  1. cann-recipes-infer
  2. /
  3. executor
cann-robotcann-robot[feat]support mxfp8 inference of GLM-5 on 950 platform
7c4fb613创建于 1 天前历史提交
文件最后提交记录最后更新时间
core
【feat】feat multi block_size in cache management1 天前
model_loader
support deepseek v41 个月前
offline
[fix]update: 重构后,编译缓存位置及目录调整;qwen_moe配置参数纠正15 天前
online
【feat】feat multi block_size in cache management1 天前
scripts
[fix] Resolved the issue that the torchrun command does not exist when the inference is started on the one-stop platform.14 天前
utils
[feat]support mxfp8 inference of GLM-5 on 950 platform1 天前
model_runner.py
refactor: 支持online多batch推理1 个月前