mzbac / AutoGPTQ-API
Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.
☆91Updated last year
Related projects ⓘ
Alternatives and complementary repositories for AutoGPTQ-API
- Visual Studio Code extension for WizardCoder☆144Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆91Updated 4 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- An autonomous AI agent extension for Oobabooga's web ui☆175Updated last year
- Python bindings for the C++ port of GPT4All-J model.☆38Updated last year
- Extension for using alternative GitHub Copilot (StarCoder API) in VSCode☆99Updated 7 months ago
- Local LLM ReAct Agent with Guidance☆154Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆35Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- 100% Private & Simple. OSS 🐍 Code Interpreter for LLMs 🦙☆34Updated last year
- Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…☆211Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆68Updated last month
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆73Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- ☆39Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆90Updated 8 months ago
- Use local llama LLM or openai to chat, discuss/summarize your documents, youtube videos, and so on.☆152Updated 6 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆112Updated last year
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆67Updated last year
- Prompt-Promptor is a python library for automatically generating prompts using LLMs☆67Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- GPT-2 small trained on phi-like data☆65Updated 8 months ago
- A prompt/context management system☆165Updated last year
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆53Updated last year
- Harnessing the Memory Power of the Camelids☆145Updated last year
- ☆55Updated last year
- starcoder server for huggingface-vscdoe custom endpoint☆167Updated 11 months ago