seonglae / llama2gptq

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
29Updated last year

Alternatives and similar repositories for llama2gptq

Users that are interested in llama2gptq are comparing it to the libraries listed below

Sorting: