seonglae / llama2gptqLinks
Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
☆30Updated last year
Alternatives and similar repositories for llama2gptq
Users that are interested in llama2gptq are comparing it to the libraries listed below
Sorting:
- ☆64Updated 2 years ago
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended☆33Updated 2 years ago
- manage histories of LLM applied applications☆91Updated last year
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆45Updated 2 years ago
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...☆282Updated last year
- ☆137Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- "Learning-based One-line intelligence Owner Network Connectivity Tool"☆16Updated 2 years ago
- ☆35Updated last year
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- Document Q&A on Wikipedia articles using LLMs☆79Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆47Updated last year
- Efficient fine-tuning for ko-llm models☆182Updated last year
- 1-Click is all you need.☆62Updated last year
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Updated last year
- ☆37Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 3 months ago
- Awesome series for LLMOps☆47Updated 5 months ago
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆38Updated last year
- Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..☆64Updated last year
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆22Updated last year
- ☆61Updated 2 years ago
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆35Updated last year
- Python monorepo template with Pants☆21Updated 2 years ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated 2 years ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated last year
- LiveQuery GPT-4: chatbot with GPT-4-powered convos & Google-powered real-time search☆85Updated 2 years ago
- Weak Labeling (NER) using ChatGPT☆38Updated 2 years ago
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Updated last year
- a Jax/Flax inference code of StarCoder☆12Updated 2 years ago