turboderp-org / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs
3,845Updated last week

Alternatives and similar repositories for exllamav2:

Users that are interested in exllamav2 are comparing it to the libraries listed below