| HunyuanVideo sparse attention svg算法 sample_mse性能优化 | 2 天前 |
| [refactor] Unified inference script for multimodal models | 1 个月前 |
| [feat] HunyuanVideo support fa/matmul quant and uaa | 2 个月前 |
| [feat]swiglu group quant | 3 天前 |
| [feat] HunyuanVideo support fa/matmul quant and uaa | 2 个月前 |
| [feat] step3p7_flash: 8 卡 / 16 rank NPU 推理适配 + 端到端优化(Decode ~5.6×)+ 图像输入 | 7 天前 |
| fix: shard VocabParallelEmbedding weight at init | 23 天前 |
| 【feat】add DeepSeek-V3.2-Exp model | 8 个月前 |
| [feat] support hunyuanImage-3.0 model inference | 5 个月前 |