Star680
378
代码介绍
代码
Issues20
Pull Requests42
流水线
Actions
讨论
Wiki
项目成员245
分析
项目设置
Star680
378
  1. cann-recipes-infer
  2. /
  3. module
cann-robotcann-robotHunyuanVideo sparse attention svg算法 sample_mse性能优化
3cf067a4创建于 2 天前历史提交
文件最后提交记录最后更新时间
blockwise_sparse
HunyuanVideo sparse attention svg算法 sample_mse性能优化2 天前
dit_cache
[refactor] Unified inference script for multimodal models1 个月前
fa_quant
[feat] HunyuanVideo support fa/matmul quant and uaa2 个月前
quantization
[feat]swiglu group quant3 天前
unified_sp
[feat] HunyuanVideo support fa/matmul quant and uaa2 个月前
fuse_moe_gmm.py
[feat] step3p7_flash: 8 卡 / 16 rank NPU 推理适配 + 端到端优化(Decode ~5.6×)+ 图像输入7 天前
linear.py
fix: shard VocabParallelEmbedding weight at init23 天前
utils.py
【feat】add DeepSeek-V3.2-Exp model8 个月前
vae_patch_parallel.py
[feat] support hunyuanImage-3.0 model inference5 个月前