Star679
378
代码介绍
代码
Issues26
Pull Requests43
流水线
Actions
讨论
Wiki
项目成员245
分析
项目设置
Star679
378
  1. cann-recipes-infer
  2. /
  3. executor
  4. /
  5. utils
cann-robotcann-robot[feat]support mxfp8 inference of GLM-5 on 950 platform
7c4fb613创建于 2 天前历史提交
文件最后提交记录最后更新时间
__init__.py
[Refactor] comm_manager refactor23 天前
common_utils.py
[feat] mooncake + hixl, import, support models lazy import25 天前
data_utils.py
[refactor] mc2 use independent communication group1 个月前
forward_metadata.py
[refactor] add page attention cache management1 个月前
graph_utils.py
[refactor] MTP refactor and R1 adaptation framework2 个月前
hccl_utils.py
[feat]support mxfp8 inference of GLM-5 on 950 platform2 天前
logging_config.py
refactor: 支持online多batch推理1 个月前
profiler_context.py
[feat]prefill profiling2 个月前
stream_utils.py
[feat]support mxfp8 inference of GLM-5 on 950 platform2 天前