-
Notifications
You must be signed in to change notification settings - Fork 0
Comparing changes
Open a pull request
base repository: AlphaKure/llama-cpp-python
base: main
head repository: abetlen/llama-cpp-python
compare: main
- 19 commits
- 34 files changed
- 4 contributors
Commits on Mar 22, 2026
-
fix(ci): Rename
huggingface-clitohf(abetlen#2149)* Fix model download in test workflow * Use hf CLI in test workflow * Use hf CLI name in CI and docs * Reference PR in changelog
Configuration menu - View commit details
-
Copy full SHA for ca3b00a - Browse repository at this point
Copy the full SHA ca3b00aView commit details -
fix(ci): Fix macos tests, support both Intel and Apple Silicon testing (
abetlen#2150) * fix(ci): use supported macos runner label * fix(ci): add apple silicon macos test coverage * fix(ci): run standard macos tests on apple silicon * fix(ci): simplify apple silicon macos install * fix(ci): disable ggml native on apple silicon runner * docs: update changelog for macos ci runner fix
Configuration menu - View commit details
-
Copy full SHA for 9f661ff - Browse repository at this point
Copy the full SHA 9f661ffView commit details -
misc: Add Ruff formatting (abetlen#2148)
* Add Ruff formatting and safe lint baseline * Update changelog for Ruff setup
Configuration menu - View commit details
-
Copy full SHA for a9b4a06 - Browse repository at this point
Copy the full SHA a9b4a06View commit details
Commits on Mar 23, 2026
-
feat: Update llama.cpp to ggml-org/llama.cpp@49bfdde (abetlen#2151)
* Update llama.cpp and sync bindings * Clean up binding compatibility shims * Remove flash attention property shim * Remove mtmd verbosity shim * Add docstrings for new bindings * Format Ruff files and add changelog entry
Configuration menu - View commit details
-
Copy full SHA for 18aa31e - Browse repository at this point
Copy the full SHA 18aa31eView commit details -
ci: add riscv64 wheel builds to release workflow (abetlen#2139)
* ci: add riscv64 wheel builds to release workflow Add a build_wheels_riscv64 job mirroring the existing arm64 QEMU-based build. Uses cibuildwheel with QEMU emulation for linux/riscv64, targeting CPython 3.10-3.14 on manylinux. Closes abetlen#2138 * ci: use cibuildwheel 3.1.2 for riscv64 wheels * docs: update changelog for riscv64 wheel PR --------- Co-authored-by: abetlen <abetlen@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for e1f8ac0 - Browse repository at this point
Copy the full SHA e1f8ac0View commit details -
fix: Qwen 3.5 support (abetlen#2152)
* fix: handle Qwen 3.5 hybrid prefix reuse * test: fix Qwen runtime unit mocks * test: drop Qwen runtime unit tests * docs: credit Qwen fix contributors in changelog * docs/tests: update default Qwen model to 3.5 0.8B * test: rebaseline Qwen 3.5 outputs * test: stabilize low-level Qwen sampling check * test: tighten Qwen 3.5 completion prompts
Configuration menu - View commit details
-
Copy full SHA for 11e7a55 - Browse repository at this point
Copy the full SHA 11e7a55View commit details -
Configuration menu - View commit details
-
Copy full SHA for a6b1807 - Browse repository at this point
Copy the full SHA a6b1807View commit details
Commits on Mar 24, 2026
-
fix(ci): release wheel workflow (abetlen#2154)
* fix(ci): harden release wheel workflow * fix(ci): document and pin release wheel baselines * fix(ci): speed up release arch builds * fix(ci): split riscv64 by python version * fix(ci): sanitize riscv64 artifact names
Configuration menu - View commit details
-
Copy full SHA for f0391c5 - Browse repository at this point
Copy the full SHA f0391c5View commit details -
fix(ci): cuda wheel workflow (abetlen#2155)
* fix(ci): harden cuda wheel workflow * fix(ci): pin cuda toolkit versions accurately * fix(ci): resolve exact cuda toolkit installs * fix(ci): align cuda toolkit roots and tags * fix(ci): pin cuda packages to nvidia label * fix(ci): allow cuda solver to mix non-cuda deps
Configuration menu - View commit details
-
Copy full SHA for 909ebf1 - Browse repository at this point
Copy the full SHA 909ebf1View commit details -
fix(ci): docker build workflow (abetlen#2156)
* fix(ci): harden docker build workflow * docs: update changelog for ci workflows
Configuration menu - View commit details
-
Copy full SHA for ccc6bc0 - Browse repository at this point
Copy the full SHA ccc6bc0View commit details -
feat: expose attention_type parameter in Llama.__init__ (abetlen#2143)
* feat: expose attention_type parameter in Llama.__init__ * docs: preserve attention_type in pickled state * docs: update changelog for attention_type --------- Co-authored-by: Victor Biederbeck <victor@moria.hiddencove.xyz> Co-authored-by: abetlen <abetlen@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 7b38c31 - Browse repository at this point
Copy the full SHA 7b38c31View commit details -
Configuration menu - View commit details
-
Copy full SHA for d6f46a5 - Browse repository at this point
Copy the full SHA d6f46a5View commit details
Commits on Mar 25, 2026
-
fix(ci): reduce CUDA binary wheel size only including cubins for curr…
…ent arches and one PTX target for forward compatibility (abetlen#2158) * fix(ci): shrink CUDA wheel fatbins * docs: update changelog for cuda wheel size fix
Configuration menu - View commit details
-
Copy full SHA for 5f9c231 - Browse repository at this point
Copy the full SHA 5f9c231View commit details -
fix: handle embedding models without KV memory (abetlen#2160)
* Fix embedding models without KV memory * Add changelog entry for embedding memory fix
Configuration menu - View commit details
-
Copy full SHA for ac59e5a - Browse repository at this point
Copy the full SHA ac59e5aView commit details -
feat: Update llama.cpp to ggml-org/llama.cpp@c0159f9 (abetlen#2161)
* Update llama.cpp to c0159f9c1 * Add changelog entry for llama.cpp update
Configuration menu - View commit details
-
Copy full SHA for c670222 - Browse repository at this point
Copy the full SHA c670222View commit details -
Configuration menu - View commit details
-
Copy full SHA for f54421b - Browse repository at this point
Copy the full SHA f54421bView commit details
Commits on Mar 29, 2026
-
fix(ci): publish distinct manylinux and musllinux cpu wheels (abetlen…
…#2165) * fix(ci): publish distinct manylinux and musllinux cpu wheels * docs: add changelog entry for linux wheel repair fix
Configuration menu - View commit details
-
Copy full SHA for fcd932a - Browse repository at this point
Copy the full SHA fcd932aView commit details -
ci: publish release wheels as py3-none (abetlen#2166)
* ci: publish CPU wheels as py3-none * docs: add changelog entry for py3-none wheel tags
Configuration menu - View commit details
-
Copy full SHA for 7613aca - Browse repository at this point
Copy the full SHA 7613acaView commit details
Commits on Mar 30, 2026
-
Configuration menu - View commit details
-
Copy full SHA for 7257ba9 - Browse repository at this point
Copy the full SHA 7257ba9View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff main...main