Skip to content

Releases: ggml-org/llama.cpp

b9553

07 Jun 21:10
9e3b928

Choose a tag to compare

common : relax sampler name matching (#23744)

  • common : relax sampler name matching

Currently, in some cases, the alternative names for samplers (like
top-k and min-p instead of the canonical top_k and min_p) are
not always recognized by the common_sampler_types_from_names function
in common/sampling.cpp.

This PR changes the signature of this function to remove the bool allow_alt_names flag, and removes all occurences of the flag from call
sites. Therefore, the function will now always match all known names.

I also changed the logic of the function to unconditionally check the
provided sampler names against both the canonical and alternative names,
and to be case-insensitive.

This fixes an issue I was seeing wherein samplers specified in the
llama-server UI were not recognized as valid when the alternative
names were used.

  • add more alt names

  • cont. fix

  • cast to unsigned char for correctness

  • common : unify sampler name mapping

  • annotate canonical vs. alt sampler name mappings per @CISC

  • Update common/sampling.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • common : auto-generate sampler name aliases per @ngxson

  • use merged map for matching

  • use .merge instead of iterating

  • nit: simplify comment

  • nit: use insert everywhere, not index assignment


Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9551

07 Jun 19:27
379ac66

Choose a tag to compare

kv-cache : avoid kv cells copies (#24277)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9550

07 Jun 15:55
f0156d1

Choose a tag to compare

kv-cache: follow the source cache size when sharing cells (#24267)

A fitted target context can end up smaller than the draft default, the
oversized assistant views then overflow the shared K/V tensors and trip
the ggml_view_4d size assert during graph reserve.

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9549

07 Jun 13:38
04eb4c4

Choose a tag to compare

llama : add Gemma4 MTP (#23398)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9548

07 Jun 12:10
8a091c4

Choose a tag to compare

spec : fix vocab compatibility check (#24256)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9547

07 Jun 09:38
465b1f0

Choose a tag to compare

arg: Skip mmproj download when user supplied mmproj (#24239)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9544

06 Jun 21:20
98d5e8b

Choose a tag to compare

common/chat : fix LFM2/LFM2.5 reasoning round-trip and leak (#24234)

  • common/chat : fix LFM2 reasoning round-trip and stray leak
  • Gate by reasoning format and whether the template supports

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9543

06 Jun 19:48
31e8249

Choose a tag to compare

mtmd: support "frame merge" for qwen-vl-based models (#21858)

  • feat: add video support for Qwen3.5

  • various clean up

  • revise the design

  • fix llava-uhd case

  • nits

  • nits 2


Co-authored-by: andrewmd5 1297077+andrewmd5@users.noreply.github.com

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9542

06 Jun 10:47
6b80c74

Choose a tag to compare

completion : remove useless statics (#24226)

Signed-off-by: Adrien Gallouët angt@huggingface.co

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

b9541

06 Jun 09:54
588f0dc

Choose a tag to compare

completion : fix format specifier in LOG_INF (#24213)

Signed-off-by: Adrien Gallouët angt@huggingface.co

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI: