expert-kit/ek-benchmark · openEuler/expert-kit - AtomGit

bb431d95创建于 2025年8月19日历史提交

文件	最后提交记录	最后更新时间
scripts	feat: add onnxruntime backend and benchmark scripts	1 年前
src	perf: use ggml operators to optimize cpu ffn forwarding (#94) * perf: use ggml operators to optimize cpu ffn forwarding * perf: supports bf16 on ggml backend * chore: make clippy happy * chore: align the types * chore: tuning & fix serialization * fix: fix padding and context size * feat: allow dropping cache after loading expert backend * chore: statically link ggml * feat: allocating tensor data from rust side * feat: allow specifying computation backend * chore: clippy & format * chore: tuning * chore: delete unused feature flags * chore: remove ggml-cuda * fix: ggml-cpu.h includes ggml.h * perf: single thread for better throughput	10 个月前
Cargo.toml	chore: update dependencies & bind ort version (#82)	10 个月前