seonglae / llama2gptqLinks
Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
☆31Updated 2 years ago
Alternatives and similar repositories for llama2gptq
Users that are interested in llama2gptq are comparing it to the libraries listed below
Sorting:
- manage histories of LLM applied applications☆91Updated 2 years ago
- HuggingChat like UI in Gradio☆70Updated 2 years ago
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended☆33Updated 2 years ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆45Updated 2 years ago
- ☆36Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆36Updated 2 years ago
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...☆284Updated last year
- Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..☆64Updated last year
- Awesome series for LLMOps☆52Updated 9 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆49Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 6 months ago
- Document Q&A on Wikipedia articles using LLMs☆79Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat☆37Updated 2 years ago
- ☆37Updated 2 years ago
- ☆64Updated 2 years ago
- ☆15Updated 3 weeks ago
- Efficient fine-tuning for ko-llm models☆184Updated last year
- ☆137Updated 2 years ago
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Updated last year
- This repo contains code for Langchain tutorials on my youtube channel.☆41Updated 2 years ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆38Updated 2 years ago
- "Learning-based One-line intelligence Owner Network Connectivity Tool"☆16Updated 2 years ago
- Awesome series for Large Language Model(LLM)s☆81Updated 9 months ago
- DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search …☆121Updated 2 years ago
- GGUF Quantization of any LLM.☆41Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- ☆18Updated last year
- LiveQuery GPT-4: chatbot with GPT-4-powered convos & Google-powered real-time search☆85Updated 2 years ago
- Weekly visualization report of Open LLM model performance based on 4 metrics.☆86Updated 2 years ago