| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
perf: bench ffn w/ weight transfer (#91) | 10 个月前 | |
perf: use ggml operators to optimize cpu ffn forwarding (#94) * perf: use ggml operators to optimize cpu ffn forwarding * perf: supports bf16 on ggml backend * chore: make clippy happy * chore: align the types * chore: tuning & fix serialization * fix: fix padding and context size * feat: allow dropping cache after loading expert backend * chore: statically link ggml * feat: allocating tensor data from rust side * feat: allow specifying computation backend * chore: clippy & format * chore: tuning * chore: delete unused feature flags * chore: remove ggml-cuda * fix: ggml-cpu.h includes ggml.h * perf: single thread for better throughput | 10 个月前 | |
perf: bench ffn using Criterion (#80) * perf: bench ffn using Criterion * chore: clippy & format * better ffn benchmarking * chore: explain why we use cuda to initialize weights | 10 个月前 | |
perf: remove useless locks & replace 3rd OnceCell with std OnceLock (#85) * perf: remove useless locks & replace 3rd OnceCell with std OnceLock * chore: clippy & format | 10 个月前 | |
perf: use ggml operators to optimize cpu ffn forwarding (#94) * perf: use ggml operators to optimize cpu ffn forwarding * perf: supports bf16 on ggml backend * chore: make clippy happy * chore: align the types * chore: tuning & fix serialization * fix: fix padding and context size * feat: allow dropping cache after loading expert backend * chore: statically link ggml * feat: allocating tensor data from rust side * feat: allow specifying computation backend * chore: clippy & format * chore: tuning * chore: delete unused feature flags * chore: remove ggml-cuda * fix: ggml-cpu.h includes ggml.h * perf: single thread for better throughput | 10 个月前 | |
perf: bench ffn using Criterion (#80) * perf: bench ffn using Criterion * chore: clippy & format * better ffn benchmarking * chore: explain why we use cuda to initialize weights | 10 个月前 |
| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
| 10 个月前 | ||
| 10 个月前 | ||
| 10 个月前 | ||
| 10 个月前 | ||
| 10 个月前 | ||
| 10 个月前 |