mzbac / GPTQ-for-LLaMa-API
Provide a way to use the GPT-QLLama model as an API
☆43Updated last year
Alternatives and similar repositories for GPTQ-for-LLaMa-API:
Users that are interested in GPTQ-for-LLaMa-API are comparing it to the libraries listed below
- Harnessing the Memory Power of the Camelids☆146Updated last year
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆70Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- Porting BabyAGI to Oobabooba.☆33Updated last year
- ☆41Updated last year
- An autonomous AI agent extension for Oobabooga's web ui☆175Updated last year
- Client-side toolkit for using large language models, including where self-hosted☆107Updated 4 months ago
- ☆37Updated last year
- Local LLaMAs/Models in VSCode☆53Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Example of calling OpenRouter from a Streamit app☆94Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Local LLM ReAct Agent with Guidance☆158Updated last year
- This code implements a Local LLM Selector from the list of Local Installed Ollama LLMs for your specific user Query☆102Updated last year
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS☆92Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- Let's create synthetic textbooks together :)☆74Updated last year
- Build your Swarm of Internet Agents using MultiOn 🚀☆78Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- ☆53Updated last year
- ☆63Updated 4 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆99Updated 5 months ago