Tags · Patater/llama.cpp

b3504

cann: Fix ggml_cann_im2col for 1D im2col (ggml-org#8819)

* fix ggml_cann_im2col for 1D im2col

* fix build warning

Aug 2, 2024
e09a800
zip
tar.gz

b3503

[SYCL] Fixing wrong VDR iq4nl value (ggml-org#8812)

Aug 2, 2024
0fbbd88
zip
tar.gz

b3502

ggml-cuda: Adding support for unified memory (ggml-org#8035)

* Adding support for unified memory

* adding again the documentation about unified memory

* refactoring: Moved the unified memory code in the correct location.

* Fixed compilation error when using hipblas

* cleaning up the documentation

* Updating the documentation

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

* adding one more case where the PR should not be enabled

---------

Co-authored-by: matteo serva <matteo.serva@gmail.com>
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

Aug 1, 2024
afbb4c1
zip
tar.gz

b3501

Build: Only include execinfo.h on linux systems that support it (ggml…

…-org#8783)

* Only enable backtrace on GLIBC linux systems

* fix missing file from copy

* use glibc macro instead of defining a custom one

Aug 1, 2024
b7a08fd
zip
tar.gz

b3500

cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (ggml-org#8800)

* cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X

* update asserts

* only use dmmv for supported types

* add test

Aug 1, 2024
7a11eb3
zip
tar.gz

b3499

cann: support q8_0 for Ascend� backend (ggml-org#8805)

Aug 1, 2024
c8a0090
zip
tar.gz

b3498

server : update llama-server embedding flag documentation (ggml-org#8779

)

Fixes ggml-org#8763

Jul 31, 2024
afbbcf3
zip
tar.gz

b3497

Build: Fix potential race condition (ggml-org#8781)

* Fix potential race condition as pointed out by @fairydreaming in ggml-org#8776

* Reference the .o rather than rebuilding every time.

* Adding in CXXFLAGS and LDFLAGS

* Removing unnecessary linker flags.

Jul 31, 2024
ed9d285
zip
tar.gz

b3496

Adding Gemma 2 2B configs (ggml-org#8784)

* Adding Gemma 2 2B configs

Updates to Q scaling and Gemma 2 model sizes to match v2 2B model.

* Update src/llama.cpp

Co-authored-by: slaren <slarengh@gmail.com>

---------

Co-authored-by: slaren <slarengh@gmail.com>

Jul 31, 2024
398ede5
zip
tar.gz

b3495

cmake : fix use of external ggml (ggml-org#8787)

Jul 31, 2024
44d28dd
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b3504

b3503

b3502

b3501

b3500

b3499

b3498

b3497

b3496

b3495

Tags: Patater/llama.cpp