seonglae / llama2gptq

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
29Updated last year

Alternatives and similar repositories for llama2gptq:

Users that are interested in llama2gptq are comparing it to the libraries listed below