seonglae / llama2gptqLinks

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.

☆30

Alternatives and similar repositories for llama2gptq

Users that are interested in llama2gptq are comparing it to the libraries listed below

Sorting:

deep-diver / PingPong
manage histories of LLM applied applications
☆91Updated last year
amrrs / LLM-QA-Bot
☆64Updated 2 years ago
Marker-Inc-Korea / RAGchain
Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...
☆281Updated last year
deep-diver / gpt2-ft-pipeline
GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended
☆32Updated last year
KyujinHan / Sakura-SOLAR-DPO
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Updated last year
deep-diver / gradio-chat
HuggingChat like UI in Gradio
☆71Updated 2 years ago
deep-diver / Vid2Persona
This project breathes life into video characters by using AI to describe their personality and then chat with you as them.
☆47Updated last year
AIAnytime / Zephyr-7B-beta-RAG-Demo
Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.
☆35Updated last year
nodematiclabs / llama-3-finetune-unsloth
☆14Updated last year
hwchase17 / conversational-retrieval-agent
☆61Updated 2 years ago
bdytx5 / mistral7B_finetune
fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI
☆38Updated last year
yujonglee / eval
Evaluate your LLM apps, RAG pipeline, any generated text, and more!
☆1Updated last year
hwchase17 / langchain-gradio-template
☆136Updated 2 years ago
gururise / openai_text_generation_inference_server
Use OpenAI with HuggingChat by emulating the text_generation_inference_server
☆44Updated 2 years ago
edumunozsala / llama-2-7B-4bit-python-coder
Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..
☆64Updated last year
fredliubojin / langchain_gradio
LiveQuery GPT-4: chatbot with GPT-4-powered convos & Google-powered real-time search
☆85Updated 2 years ago
deep-diver / paperqa-ui
☆17Updated last year
langchain-ai / langchain-upstage
☆13Updated this week
ainbr / chatgpt-weak-labeler-webui
Weak Labeling (NER) using ChatGPT
☆38Updated 2 years ago
aju22 / DocumentGPT
DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search …
☆120Updated last year
davidkim205 / nox
Efficient fine-tuning for ko-llm models
☆182Updated last year
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆104Updated 2 months ago
francisjervis / Flask-Langchain
A Flask extension to manage Langchain chat memory and document stores in Flaask apps.
☆71Updated 2 years ago
hunkim / es-gpt
☆212Updated 2 years ago
streamlit / StreamlitLangChain
☆59Updated last year
wangermeng2021 / llm-webui
A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat
☆36Updated last year
StableFluffy / EasyLLMFeaturePorter
1-Click is all you need.
☆62Updated last year
KennethanCeyer / awesome-llmops
Awesome series for LLMOps
☆47Updated 4 months ago
georgesung / LLM-WikipediaQA
Document Q&A on Wikipedia articles using LLMs
☆78Updated last year
experienced-dev / chatgpt-plugin-fastapi-langchain-chroma
An Example Plugin for ChatGPT, Utilizing FastAPI, LangChain and Chroma
☆49Updated 2 years ago