文件最后提交记录最后更新时间
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
Support boxed variants of chat and completion models (#329) * Support boxed variants of chat and completion models * Document boxed chat methods1 年前
Support boxed variants of chat and completion models (#329) * Support boxed variants of chat and completion models * Document boxed chat methods1 年前
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
Fix clippy lints (#381) 11 个月前
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
Generalize task chunker and add more logging (#423) * generalize task chunker and add more logging * Fix misleading variable name in chunking example (#429) * Initial plan * Fix variable name: rename hypothetical to summarizer in chunking.rs example Co-authored-by: ealmloff <66571940+ealmloff@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ealmloff <66571940+ealmloff@users.noreply.github.com> * fix formatting --------- Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com>2 个月前
Add support for Qwen 2.5 Vision (#382) * implement qwen vision embed and patch merger * implement qwen vision block * calculate the rope index of images and videos * add get_window_index * fix get window index * unwrap less * Create media source api * integrate the new media support into the language model trait * Create QwenVisionTransformer * implement QwenVisionTransformer::forward * fix formatting * fix loading qwen 2.5 vl * fix rot_pos_emb * add image preprocessing utilities * fix vision rope * fix mask * Fix feed forward * qwen vision forward working * unwrap less * clean up * create tensor tools cli * fix cli * fix fuse tokenizer * move parse into its own module * Use llama.cpp compatible tensor names * add preset * load qwen vision metadata from the gguf file * fix loading the vision encoder * test process image * forward eps and add more tests * fix image processing * implement image chat templating * full pipeline running * fix formatting * use 3d rope index * fix dimension_sections decoding * qwen vl rope working * remove logs * fix rope tests * fix rope size * fix rope index to tensor conversion * Fix rope updates * normalize image input * match image resize behavior * fix fullatt_block calculation * vision model works * remove logs * add more qwen vl presets * fix some clippy lints * fix clippy * Fix ToChatMessage * expose image processing hints * remove unwraps * fix unwraps in tests * fix more examples11 个月前
Add support for Qwen 2.5 Vision (#382) * implement qwen vision embed and patch merger * implement qwen vision block * calculate the rope index of images and videos * add get_window_index * fix get window index * unwrap less * Create media source api * integrate the new media support into the language model trait * Create QwenVisionTransformer * implement QwenVisionTransformer::forward * fix formatting * fix loading qwen 2.5 vl * fix rot_pos_emb * add image preprocessing utilities * fix vision rope * fix mask * Fix feed forward * qwen vision forward working * unwrap less * clean up * create tensor tools cli * fix cli * fix fuse tokenizer * move parse into its own module * Use llama.cpp compatible tensor names * add preset * load qwen vision metadata from the gguf file * fix loading the vision encoder * test process image * forward eps and add more tests * fix image processing * implement image chat templating * full pipeline running * fix formatting * use 3d rope index * fix dimension_sections decoding * qwen vl rope working * remove logs * fix rope tests * fix rope size * fix rope index to tensor conversion * Fix rope updates * normalize image input * match image resize behavior * fix fullatt_block calculation * vision model works * remove logs * add more qwen vl presets * fix some clippy lints * fix clippy * Fix ToChatMessage * expose image processing hints * remove unwraps * fix unwraps in tests * fix more examples11 个月前
Fix clippy lints (#381) 11 个月前
Add support for Qwen 2.5 Vision (#382) * implement qwen vision embed and patch merger * implement qwen vision block * calculate the rope index of images and videos * add get_window_index * fix get window index * unwrap less * Create media source api * integrate the new media support into the language model trait * Create QwenVisionTransformer * implement QwenVisionTransformer::forward * fix formatting * fix loading qwen 2.5 vl * fix rot_pos_emb * add image preprocessing utilities * fix vision rope * fix mask * Fix feed forward * qwen vision forward working * unwrap less * clean up * create tensor tools cli * fix cli * fix fuse tokenizer * move parse into its own module * Use llama.cpp compatible tensor names * add preset * load qwen vision metadata from the gguf file * fix loading the vision encoder * test process image * forward eps and add more tests * fix image processing * implement image chat templating * full pipeline running * fix formatting * use 3d rope index * fix dimension_sections decoding * qwen vl rope working * remove logs * fix rope tests * fix rope size * fix rope index to tensor conversion * Fix rope updates * normalize image input * match image resize behavior * fix fullatt_block calculation * vision model works * remove logs * add more qwen vl presets * fix some clippy lints * fix clippy * Fix ToChatMessage * expose image processing hints * remove unwraps * fix unwraps in tests * fix more examples11 个月前
Fix clippy lints (#381) 11 个月前
Remove anyhow (#313) * Remove anyhow from llama source and cache Partial fix for: https://github.com/floneum/floneum/issues/186 * remove anyhow where possible in kalosm-* libraries * Update examples with breaking changes * fix formatting * Fix clippy * Fix floneum build --------- Co-authored-by: Josh McKinney <joshka@users.noreply.github.com>1 年前
Use a chat model in the custom llama example 1 年前
Fix clippy lints (#381) 11 个月前
Add support for Gemma3 and codestral (#378) * add gemma 3 support * fix gemma example * add a warning about system prompts * fix codestral chat template * fix md_get * remove extra file * fix formatting * remove commented out code * fix kalosm llama example1 年前
Remove anyhow (#313) * Remove anyhow from llama source and cache Partial fix for: https://github.com/floneum/floneum/issues/186 * remove anyhow where possible in kalosm-* libraries * Update examples with breaking changes * fix formatting * Fix clippy * Fix floneum build --------- Co-authored-by: Josh McKinney <joshka@users.noreply.github.com>1 年前
Fix clippy lints (#381) 11 个月前
Add support for Qwen 2.5 Vision (#382) * implement qwen vision embed and patch merger * implement qwen vision block * calculate the rope index of images and videos * add get_window_index * fix get window index * unwrap less * Create media source api * integrate the new media support into the language model trait * Create QwenVisionTransformer * implement QwenVisionTransformer::forward * fix formatting * fix loading qwen 2.5 vl * fix rot_pos_emb * add image preprocessing utilities * fix vision rope * fix mask * Fix feed forward * qwen vision forward working * unwrap less * clean up * create tensor tools cli * fix cli * fix fuse tokenizer * move parse into its own module * Use llama.cpp compatible tensor names * add preset * load qwen vision metadata from the gguf file * fix loading the vision encoder * test process image * forward eps and add more tests * fix image processing * implement image chat templating * full pipeline running * fix formatting * use 3d rope index * fix dimension_sections decoding * qwen vl rope working * remove logs * fix rope tests * fix rope size * fix rope index to tensor conversion * Fix rope updates * normalize image input * match image resize behavior * fix fullatt_block calculation * vision model works * remove logs * add more qwen vl presets * fix some clippy lints * fix clippy * Fix ToChatMessage * expose image processing hints * remove unwraps * fix unwraps in tests * fix more examples11 个月前
feat(SurrealDB Integration): Hybrid Search Support (#415) * Added hybrid search setup * Refactored for more testable code * Added unit test for score normalization function * Added unit tests for distance to similarity calculation * Added tests for weighted score calculation * Added tests for rrf * Added tests for results merging * Finished units, with rrf and weighted pipeline test * Refactored for builder pattern * Completed hybrid search api * Created integration tests * Updated surrealdb version and queries * Added conditional materialized fild * Fixed query definitions for both semantic and keyword * Fixed normalization issues with weighted search * Fixed ranking issues in rrf * Finalized queries for materialized fields and index rebuilding * Created default value for rrf * Completed integration tests * Simplified hybrid search with default field * Removed unused code and added comments * Making sure search parameters are clamped and adjusted * Completed example * tighten test assertions, make more functions private to avoid semvar breaking changes in the future and clean up the error type * Returning original example back * Clippy fixes * fixed doc tests * fixed fmt checks --------- Co-authored-by: Evan Almloff <evanalmloff@gmail.com>4 个月前
reorganize packages 2 年前
Make kalosm-sound usable in wasm (#407) * make kalosm-sound usable in wasm * smaller default tiny en model * fix cargo check * forward input features * fix examples6 个月前
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
Fix clippy lints (#381) 11 个月前
Generalize task chunker and add more logging (#423) * generalize task chunker and add more logging * Fix misleading variable name in chunking example (#429) * Initial plan * Fix variable name: rename hypothetical to summarizer in chunking.rs example Co-authored-by: ealmloff <66571940+ealmloff@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ealmloff <66571940+ealmloff@users.noreply.github.com> * fix formatting --------- Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com>2 个月前
reorganize packages 2 年前
Fix clippy lints (#381) 11 个月前
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
add phi 4 support (#323) 1 年前
Fix clippy lints (#381) 11 个月前
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
Fix clippy lints (#381) 11 个月前
Add support for Qwen 2.5 Vision (#382) * implement qwen vision embed and patch merger * implement qwen vision block * calculate the rope index of images and videos * add get_window_index * fix get window index * unwrap less * Create media source api * integrate the new media support into the language model trait * Create QwenVisionTransformer * implement QwenVisionTransformer::forward * fix formatting * fix loading qwen 2.5 vl * fix rot_pos_emb * add image preprocessing utilities * fix vision rope * fix mask * Fix feed forward * qwen vision forward working * unwrap less * clean up * create tensor tools cli * fix cli * fix fuse tokenizer * move parse into its own module * Use llama.cpp compatible tensor names * add preset * load qwen vision metadata from the gguf file * fix loading the vision encoder * test process image * forward eps and add more tests * fix image processing * implement image chat templating * full pipeline running * fix formatting * use 3d rope index * fix dimension_sections decoding * qwen vl rope working * remove logs * fix rope tests * fix rope size * fix rope index to tensor conversion * Fix rope updates * normalize image input * match image resize behavior * fix fullatt_block calculation * vision model works * remove logs * add more qwen vl presets * fix some clippy lints * fix clippy * Fix ToChatMessage * expose image processing hints * remove unwraps * fix unwraps in tests * fix more examples11 个月前
Remove anyhow (#313) * Remove anyhow from llama source and cache Partial fix for: https://github.com/floneum/floneum/issues/186 * remove anyhow where possible in kalosm-* libraries * Update examples with breaking changes * fix formatting * Fix clippy * Fix floneum build --------- Co-authored-by: Josh McKinney <joshka@users.noreply.github.com>1 年前
Remote chat, remote structured generation models, and single file gguf chat model loading (#319) * add chat template support and remove the VectorSpace trait * move sampling and chat templates to kalosm llama * update kalosm-llama unstructured generation to the new interface * restore structured generation module * Restore llama implementation of structured generation * clean up kalosm-llama clippy lints * restore llama chat and structured chat implementation * improve infer chat example * add support for remote chat models * support constraints for openai remote models * load the tokenizer from the gguf file if a huggingface tokenizer is not present * Fix tokenizer conversion * restore chat struct * Fix chat implementation with llama * remove tokio from language model * Create chat and text completion extension traits * add task helper to the chat extension trait * update kalosm-language to new task interface * make llama callable * add with_constraints method to task * fix task example * update examples to new chat and task api * set tools to none to fix llama chat template * Add helpers for the default parser for a specific type and model combo * simplify constrained rust type example * restore prompt annealing * fix structured example * document text completion model * document new chat api * update task documentation * Fix tokenizer gguf * fix custom llama source example * fix remaining tests * add logging to remote examples * Clippy fixes * More clippy fixes * use function call in docs more constantly * fix remaining doc tests1 年前
Add a separate method for embedding queries 1 年前
Fix clippy lints (#381) 11 个月前
Add support for Qwen 2.5 Vision (#382) * implement qwen vision embed and patch merger * implement qwen vision block * calculate the rope index of images and videos * add get_window_index * fix get window index * unwrap less * Create media source api * integrate the new media support into the language model trait * Create QwenVisionTransformer * implement QwenVisionTransformer::forward * fix formatting * fix loading qwen 2.5 vl * fix rot_pos_emb * add image preprocessing utilities * fix vision rope * fix mask * Fix feed forward * qwen vision forward working * unwrap less * clean up * create tensor tools cli * fix cli * fix fuse tokenizer * move parse into its own module * Use llama.cpp compatible tensor names * add preset * load qwen vision metadata from the gguf file * fix loading the vision encoder * test process image * forward eps and add more tests * fix image processing * implement image chat templating * full pipeline running * fix formatting * use 3d rope index * fix dimension_sections decoding * qwen vl rope working * remove logs * fix rope tests * fix rope size * fix rope index to tensor conversion * Fix rope updates * normalize image input * match image resize behavior * fix fullatt_block calculation * vision model works * remove logs * add more qwen vl presets * fix some clippy lints * fix clippy * Fix ToChatMessage * expose image processing hints * remove unwraps * fix unwraps in tests * fix more examples11 个月前
Add support for Qwen 2.5 Vision (#382) * implement qwen vision embed and patch merger * implement qwen vision block * calculate the rope index of images and videos * add get_window_index * fix get window index * unwrap less * Create media source api * integrate the new media support into the language model trait * Create QwenVisionTransformer * implement QwenVisionTransformer::forward * fix formatting * fix loading qwen 2.5 vl * fix rot_pos_emb * add image preprocessing utilities * fix vision rope * fix mask * Fix feed forward * qwen vision forward working * unwrap less * clean up * create tensor tools cli * fix cli * fix fuse tokenizer * move parse into its own module * Use llama.cpp compatible tensor names * add preset * load qwen vision metadata from the gguf file * fix loading the vision encoder * test process image * forward eps and add more tests * fix image processing * implement image chat templating * full pipeline running * fix formatting * use 3d rope index * fix dimension_sections decoding * qwen vl rope working * remove logs * fix rope tests * fix rope size * fix rope index to tensor conversion * Fix rope updates * normalize image input * match image resize behavior * fix fullatt_block calculation * vision model works * remove logs * add more qwen vl presets * fix some clippy lints * fix clippy * Fix ToChatMessage * expose image processing hints * remove unwraps * fix unwraps in tests * fix more examples11 个月前
Remove anyhow (#313) * Remove anyhow from llama source and cache Partial fix for: https://github.com/floneum/floneum/issues/186 * remove anyhow where possible in kalosm-* libraries * Update examples with breaking changes * fix formatting * Fix clippy * Fix floneum build --------- Co-authored-by: Josh McKinney <joshka@users.noreply.github.com>1 年前
Add support for Qwen 2.5 Vision (#382) * implement qwen vision embed and patch merger * implement qwen vision block * calculate the rope index of images and videos * add get_window_index * fix get window index * unwrap less * Create media source api * integrate the new media support into the language model trait * Create QwenVisionTransformer * implement QwenVisionTransformer::forward * fix formatting * fix loading qwen 2.5 vl * fix rot_pos_emb * add image preprocessing utilities * fix vision rope * fix mask * Fix feed forward * qwen vision forward working * unwrap less * clean up * create tensor tools cli * fix cli * fix fuse tokenizer * move parse into its own module * Use llama.cpp compatible tensor names * add preset * load qwen vision metadata from the gguf file * fix loading the vision encoder * test process image * forward eps and add more tests * fix image processing * implement image chat templating * full pipeline running * fix formatting * use 3d rope index * fix dimension_sections decoding * qwen vl rope working * remove logs * fix rope tests * fix rope size * fix rope index to tensor conversion * Fix rope updates * normalize image input * match image resize behavior * fix fullatt_block calculation * vision model works * remove logs * add more qwen vl presets * fix some clippy lints * fix clippy * Fix ToChatMessage * expose image processing hints * remove unwraps * fix unwraps in tests * fix more examples11 个月前