Anbeeld / beellama.cppView on GitHub
DFlash & TurboQuant in llama.cpp with up to 3x faster generation and 7.5x more KV cache in same VRAM
360May 18, 2026Updated this week

Alternatives and similar repositories for beellama.cpp

Users that are interested in beellama.cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?