文件最后提交记录最后更新时间
CANN: Disable acl_graph for prefill stage (#15933) Since the prefill length is not fixed, graphs constructed for the prefill stage cannot be reused. For this reason, ACL graph execution is disabled by default during prefill.8 个月前
docs : update HOWTO‑add‑model.md for ModelBase and new model classes (#14874) This patch updates the example in docs/development/HOWTO-add-model.md to reflect recent changes after TextModel and MmprojModel were introduced. It replaces the outdated Model base class with TextModel or MmprojModel and updates the registration example accordingly. Signed-off-by: Wook Song <wook16.song@samsung.com>10 个月前
model : support MiniCPM-V 4.5 (#15575) 9 个月前
ggml-zdnn: fix #15414, activate FP16 and BF16 acceleration and incorrect zTensor free (#15839) 8 个月前
repo : update links to new url (#11886) * repo : update links to new url ggml-ci * cont : more urls ggml-ci1 年前
ggml-zdnn: fix #15414, activate FP16 and BF16 acceleration and incorrect zTensor free (#15839) 8 个月前
Update build.md to remove MSVC arm64 notes (#15684) Removed information about MSVC compiler limitations for arm64 builds.9 个月前
musa: upgrade musa sdk to rc4.2.0 (#14498) * musa: apply mublas API changes Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: update musa version to 4.2.0 Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: restore MUSA graph settings in CMakeLists.txt Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: disable mudnnMemcpyAsync by default Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: switch back to non-mudnn images Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * minor changes Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: restore rc in docker image tag Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> --------- Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>10 个月前
server : add documentation for parallel_tool_calls param (#15647) Co-authored-by: Pierre F <no@p.e>9 个月前
docs : add "Quick start" section for new users (#13862) * docs : add "Quick start" section for non-technical users * rm flox * Update README.md11 个月前
llguidance build fixes for Windows (#11664) * setup windows linking for llguidance; thanks @phil-scott-78 * add build instructions for windows and update script link * change VS Community link from DE to EN * whitespace fix1 年前
mtmd : add support for Voxtral (#14862) * mtmd : add support for Voxtral * clean up * fix python requirements * add [BEGIN_AUDIO] token * also support Devstral conversion * add docs and tests * fix regression for ultravox * minor coding style improvement * correct project activation fn * Apply suggestions from code review Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> --------- Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>10 个月前
ggml-zdnn: fix #15414, activate FP16 and BF16 acceleration and incorrect zTensor free (#15839) 8 个月前