| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
refactor(quantized_tensor): replace QuantizedTensorView with tensor tuples Co-authored-by: Muu<koimuu@163.com> | 13 天前 | |
refactor(quantized_tensor): replace QuantizedTensorView with tensor tuples Co-authored-by: Muu<koimuu@163.com> | 13 天前 | |
refactor(fp8): fill fp8 vacancy Co-authored-by: Muu<koimuu@163.com> | 2 个月前 | |
feat: enhance fp8_blockwise Co-authored-by: junhang<wangjunhang7@huawei.com> # message auto-generated for no-merge-commit merge: !88 merge feat_fp_blockwise into main feat(fp8_blockwise): support fp8_blockwise_grouped_quant Created-by: goodflower9 Commit-by: junhang Merged-by: ascend-robot Description: feat: enhance fp8_blockwise 测试结果见 https://wiki.huawei.com/domains/177655/wiki/373156/WIKI2026061211450503 See merge request: Ascend/TransformerEngineNPU!88 | 12 天前 | |
refactor(quantized_tensor): replace QuantizedTensorView with tensor tuples Co-authored-by: Muu<koimuu@163.com> | 13 天前 | |
refactor(quantized_tensor): replace QuantizedTensorView with tensor tuples Co-authored-by: Muu<koimuu@163.com> | 13 天前 | |
refactor(quantized_tensor): replace QuantizedTensorView with tensor tuples Co-authored-by: Muu<koimuu@163.com> | 13 天前 | |
refactor(quantized_tensor): replace QuantizedTensorView with tensor tuples Co-authored-by: Muu<koimuu@163.com> | 13 天前 | |
refactor(grouped_linear,gemm,fp8): overhaul grouped matmul with unified dispatch, NPU version check, and FP8 quantization cleanup Co-authored-by: Muu<koimuu@163.com> | 18 天前 |
| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
| 13 天前 | ||
| 13 天前 | ||
| 2 个月前 | ||
| 12 天前 | ||
| 13 天前 | ||
| 13 天前 | ||
| 13 天前 | ||
| 13 天前 | ||
| 18 天前 |