Tags · MyselfTry/llama.cpp

b5306

sync : ggml

ggml-ci

May 7, 2025
d879433
zip
tar.gz

b5303

llama : deci : support ffn-free with attention (ggml-org#13296)

May 7, 2025
bc4e112
zip
tar.gz

b5302

common : Add a warning when we can't match samplers from a string or …

…char. (ggml-org#13330)

May 7, 2025
39e73ae
zip
tar.gz

b5301

cuda : remove nrows_x in mul_mat_q_process_tile (ggml-org#13325)

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

May 7, 2025
1f73301
zip
tar.gz

b5300

examples : remove infill (ggml-org#13283)

ggml-ci

May 7, 2025
4773d7a
zip
tar.gz

b5299

llama : support tie embedding for chatglm models (ggml-org#13328)

May 7, 2025
6c7fd67
zip
tar.gz

gguf-v0.16.3

Version 0.16.3 release

May 6, 2025
a7366fa
zip
tar.gz

b5298

CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (ggml-org#13135)

May 6, 2025
141a908
zip
tar.gz

b5297

clip : refactor graph builder (ggml-org#13321)

* mtmd : refactor graph builder

* fix qwen2vl

* clean up siglip cgraph

* pixtral migrated

* move minicpmv to a dedicated build function

* move max_feature_layer to build_llava

* use build_attn for minicpm resampler

* fix windows build

* add comment for batch_size

* also support tinygemma3 test model

* qwen2vl does not use RMS norm

* fix qwen2vl norm (2)

May 6, 2025
32916a4
zip
tar.gz

b5296

sampling : make top_n_sigma no-op at <=0 or a single candidate (ggml-…

…org#13345)

May 6, 2025
ffc7272
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b5306

b5303

b5302

b5301

b5300

b5299

gguf-v0.16.3

b5298

b5297

b5296

Tags: MyselfTry/llama.cpp