Fork
0
代码
介绍
代码
Issues
Pull Requests
流水线
Actions
讨论
Wiki
项目成员
分析
项目设置
Fork
0
main
kalosm
/
interfaces
/
language-model
/
src
/
model
下载当前目录
G
GitHub
Feature: Add Anthropic Structured Outputs (
#428
)
34882bc1
创建于
3月2日
历史提交
文件
最后提交记录
最后更新时间
boxed.rs
Bump wgpu and fix wasm support for llama (#416) * bump wgpu and fix wasm support for llama * use git version of wgpu * fix checks and require tokio for ocr * fix formatting * fix the quantized test * fix large dispatches * fix doctests * fix clippy * match wgpu version in ci and cache windows * fix formatting * setup vulkan * set vulkan version * cargo update * failing tests on all platforms * fix f16 tensors on gpus that don't support f16 * restore f16 detection * more resilient caching * pull out lock current logic * use dxc * use std file locks * try using a smaller batch limit * install warp
4 个月前
ext.rs
Floneum cpu (#424) * fix the summary chunker * start * wip * some progress * abox * start simding * add some operations * pull out ResolveTensor * refactor * more refactoring * fill in missing ops * remove ResolvedTensorMut * add benchmarks * faster * matmul op * reduce operations * refactor into multiple files * fuse ops * more ops * clean up comments * remove unsafe * remove more unsafe * initial fusor crate * refactoring fusor cpu * tensor type * move more ops to tensor * more refactoring * more refactoring * clean up some unused code * remove most of fusor * slice assign op * quantized cpu * optimize qmatmul * wip * new types crate * use layout type in cpu * pull out rank * as_slice for gpuor * better add impl * partially borrowed ops * refactor pairwise * more gpuor ops * more ops * batched matmul * implement some composite ops * reduce ops * normalization ops * reshape ops * move most of the logic onto layout itself * move sliding window into the layout type * a bunch more composite ops * more methods * more ops * rename qmatmul * batched cpu qmatmul * quantized support for gpuor * rename to be closer to fusor-core * start porting rwhisper to cpu * wip whisper * optimize a bit * more optimization * more optimization * rename eval * more lazy * remove some logs * move all matches to dispatch * more dispatch * fix add_ and other ops * use layout instead * get rid of ResolveTensor * remove useless Expr methods * fix fusor * shape on cpu tensor * as ref tensor * remove Expr trait * remove expr from fusor * owned and copy cpu tensors * fix fusor/fusor * start migrating fusor * fix fusor * rwhisper compiles * fix reshape * fix cpu map layout * remove debug * remove submodule * simd qmatmul * nightly optimizations * switch llama to fusor * remove const generic from QMatrix * don't use rayon for parallelization * MapLayout is currently a concrete tensor * simd gather * less matching in slow softmax * wip * remove SimdComparisonOp * fix fusor * lazy maplayout * fix maplayout * rwhisper compiles * fix rwhisper * wip * llama running! * fix vision on cpu * wip q5k * start gpu impl * fix out of memory * q5k_sgemv * fix formatting * fix some warnings * remove some dead code * refactor from array * Use rustversion to conditionally enable nightly NEON intrinsics The vdotq_s32 intrinsic requires the nightly-only stdarch_neon_dotprod feature. This change uses rustversion to detect nightly builds and only enables the optimized NEON code path when building with nightly Rust. On stable Rust, the scalar fallback implementation is used instead. * fix formatting * fix nightly formatting * import SimdElement trait for x86 gather operations * fix unsafe blocks for Rust 2024 edition * fix clippy warnings in test modules * fix tensor shape handling in conv1d and layer_norm loading * skip qwen download in CI * fix gelu in warp
3 个月前
generation_parameters.rs
Feature: Add Anthropic Structured Outputs (#428) * refactor * add e2e tests for anthropic structured gen and more models * pull out shared logic * fix clippy --------- Co-authored-by: Evan Almloff <evanalmloff@gmail.com>
2 个月前
mod.rs
Feature: Add Anthropic Structured Outputs (#428) * refactor * add e2e tests for anthropic structured gen and more models * pull out shared logic * fix clippy --------- Co-authored-by: Evan Almloff <evanalmloff@gmail.com>
2 个月前