Skip to content

Tags: Patater/llama.cpp

Tags

b3504

Toggle b3504's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cann: Fix ggml_cann_im2col for 1D im2col (ggml-org#8819)

* fix ggml_cann_im2col for 1D im2col

* fix build warning

b3503

Toggle b3503's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[SYCL] Fixing wrong VDR iq4nl value (ggml-org#8812)

b3502

Toggle b3502's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml-cuda: Adding support for unified memory (ggml-org#8035)

* Adding support for unified memory

* adding again the documentation about unified memory

* refactoring: Moved the unified memory code in the correct location.

* Fixed compilation error when using hipblas

* cleaning up the documentation

* Updating the documentation

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

* adding one more case where the PR should not be enabled

---------

Co-authored-by: matteo serva <matteo.serva@gmail.com>
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

b3501

Toggle b3501's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Build: Only include execinfo.h on linux systems that support it (ggml…

…-org#8783)

* Only enable backtrace on GLIBC linux systems

* fix missing file from copy

* use glibc macro instead of defining a custom one

b3500

Toggle b3500's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (ggml-org#8800)

* cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X

* update asserts

* only use dmmv for supported types

* add test

b3499

Toggle b3499's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cann: support q8_0 for Ascend� backend (ggml-org#8805)

b3498

Toggle b3498's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
server : update llama-server embedding flag documentation (ggml-org#8779

)

Fixes ggml-org#8763

b3497

Toggle b3497's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Build: Fix potential race condition (ggml-org#8781)

* Fix potential race condition as pointed out by @fairydreaming in ggml-org#8776

* Reference the .o rather than rebuilding every time.

* Adding in CXXFLAGS and LDFLAGS

* Removing unnecessary linker flags.

b3496

Toggle b3496's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Adding Gemma 2 2B configs (ggml-org#8784)

* Adding Gemma 2 2B configs

Updates to Q scaling and Gemma 2 model sizes to match v2 2B model.

* Update src/llama.cpp

Co-authored-by: slaren <slarengh@gmail.com>

---------

Co-authored-by: slaren <slarengh@gmail.com>

b3495

Toggle b3495's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cmake : fix use of external ggml (ggml-org#8787)