Lizonghang / prima.cpp

prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters
260Updated this week

Alternatives and similar repositories for prima.cpp:

Users that are interested in prima.cpp are comparing it to the libraries listed below