seonglae / llama2gptqLinks
Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
☆30Updated last year
Alternatives and similar repositories for llama2gptq
Users that are interested in llama2gptq are comparing it to the libraries listed below
Sorting:
- manage histories of LLM applied applications☆91Updated last year
- ☆64Updated 2 years ago
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...☆281Updated last year
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended☆32Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆47Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆35Updated last year
- ☆14Updated last year
- ☆61Updated 2 years ago
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆38Updated last year
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!☆1Updated last year
- ☆136Updated 2 years ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆44Updated 2 years ago
- Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..☆64Updated last year
- LiveQuery GPT-4: chatbot with GPT-4-powered convos & Google-powered real-time search☆85Updated 2 years ago
- ☆17Updated last year
- ☆13Updated this week
- Weak Labeling (NER) using ChatGPT☆38Updated 2 years ago
- DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search …☆120Updated last year
- Efficient fine-tuning for ko-llm models☆182Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 2 months ago
- A Flask extension to manage Langchain chat memory and document stores in Flaask apps.☆71Updated 2 years ago
- ☆212Updated 2 years ago
- ☆59Updated last year
- A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat☆36Updated last year
- 1-Click is all you need.☆62Updated last year
- Awesome series for LLMOps☆47Updated 4 months ago
- Document Q&A on Wikipedia articles using LLMs☆78Updated last year
- An Example Plugin for ChatGPT, Utilizing FastAPI, LangChain and Chroma☆49Updated 2 years ago