Comparing changes

…precated symbol handling - Update vendor/llama.cpp to latest main branch for nemotron_h architecture support - Disable mtmd build in CMakeLists.txt: latest llama.cpp has CMake compatibility issues with mtmd module that prevent build completion. mtmd is not required for nemotron_h. - Add graceful deprecated symbol handling in _ctypes_extensions.py: Wrap getattr() in try/except to handle missing C symbols from deprecated functions removed in latest llama.cpp. Returns stub functions instead of hard failures, allowing import to succeed. Result: nemotron-nano-12b-gguf now loads and benchmarks successfully - Model architecture: nemotron_h (Mamba-2 hybrid) - Benchmark speed: 18.9 tokens/sec - Test status: PASS (5/5 prompts validated) 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>

- Document fork relationship with abetlen/llama-cpp-python upstream - Add build instructions with CMAKE_CUDA_ARCHITECTURES=120 for SM 12.0 - Explain integration with llm-dev project - Include common tasks and troubleshooting steps

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comparing changes

Open a pull request

Uh oh!

Commits on Nov 18, 2025

Commits on Nov 19, 2025

This comparison is taking too long to generate.

Uh oh!