4 bits quantization of LLaMa using GPTQ
☆131Jun 3, 2023Updated 3 years ago
Alternatives and similar repositories for GPTQ-for-LLaMa
Users that are interested in GPTQ-for-LLaMa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stable Diffusion web UI☆11Sep 13, 2022Updated 3 years ago
- Easiest 1-click way to install and use Stable Diffusion on your computer. Provides a browser UI for generating images from text prompts a…☆25Apr 18, 2023Updated 3 years ago
- ☆38Apr 14, 2026Updated last month
- 4 bits quantization of LLaMA using GPTQ☆3,073Jul 13, 2024Updated last year
- ☆13Oct 22, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆11Apr 26, 2023Updated 3 years ago
- ☆10Jul 25, 2023Updated 2 years ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 3 years ago
- Various scripts for working with local LLMs☆16Oct 19, 2023Updated 2 years ago
- ☆12Jun 13, 2023Updated 2 years ago
- Simplified installers for oobabooga/text-generation-webui.☆566Sep 23, 2023Updated 2 years ago
- Flexible Python package for managing and extending LLM based agents☆24May 14, 2023Updated 3 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 4 months ago
- 🥷 The open framework for building AI Assistants☆18Feb 21, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An extension for text-generation-webui by oobabooga. Adds options to keep tabs on page and to move extensions into a sidebar.☆23Sep 24, 2023Updated 2 years ago
- C/C++ implementation of PygmalionAI/pygmalion-6b☆55Apr 18, 2023Updated 3 years ago
- ☆22Sep 2, 2023Updated 2 years ago
- Checker of "enable" statuses in SD Web UI☆59Aug 27, 2025Updated 9 months ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32May 25, 2023Updated 3 years ago
- ☆34May 17, 2023Updated 3 years ago
- ☆15Apr 22, 2023Updated 3 years ago
- Book Quick Starter Kit - Write Your Own Book in Plain Text (with Markdown)☆20Jul 4, 2016Updated 9 years ago
- ☆42Mar 12, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Gradio UI for XTTSv2 and RVC.☆65Sep 26, 2024Updated last year
- Dreambooth for colab☆31Dec 25, 2023Updated 2 years ago
- mov2mov extension for AUTOMATIC1111/stable-diffusion-webui☆27Sep 13, 2023Updated 2 years ago
- Image postprocessing extension for stable diffusion webui.☆54Sep 16, 2023Updated 2 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆62Apr 30, 2023Updated 3 years ago
- ☆28Jun 26, 2023Updated 2 years ago
- Containerized DCSS Server☆16Updated this week
- Webui for using XTTS and for finetuning it☆114Sep 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple and fast server for GPTQ-quantized LLaMA inference☆24May 18, 2023Updated 3 years ago
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆33Jul 14, 2023Updated 2 years ago
- Simple, expressive, pythonic datatypes for manipulating curves parameterized by keyframes and interpolators.☆37Dec 14, 2023Updated 2 years ago
- controlnet v1.1 colab☆88May 29, 2024Updated 2 years ago
- Simple Autogpt with tree of thoughts☆14May 25, 2023Updated 3 years ago
- Fast and memory-efficient exact attention - Windows wheels☆32Mar 3, 2024Updated 2 years ago
- An AI-powered security analysis tool for web applications that combines Large Language Model (LLM) analysis with intelligent agent-based …☆46Jul 26, 2025Updated 10 months ago