A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.
☆23Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for GPTQ-for-LLaMa-CUDA
Users that are interested in GPTQ-for-LLaMa-CUDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 30, 2023Updated 2 years ago
- Fast and memory-efficient exact attention - Windows wheels☆33Mar 3, 2024Updated 2 years ago
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆33Jul 14, 2023Updated 2 years ago
- 8-bit CUDA functions for PyTorch☆27Nov 18, 2023Updated 2 years ago
- ☆16Apr 23, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Aug 2, 2023Updated 2 years ago
- ChatGPT CSS style☆14Apr 28, 2024Updated 2 years ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32May 25, 2023Updated 3 years ago
- Get more done with LLMs☆13Jan 19, 2024Updated 2 years ago
- Web page with political compass quiz results for open LLMs☆38Jan 31, 2024Updated 2 years ago
- LLM shell and document interogator☆14Jul 24, 2023Updated 2 years ago
- Precompiled Wheels for GPTQ-for-LLaMa☆19Jul 26, 2023Updated 2 years ago
- ComfyUI node pack by cerspense☆37Dec 29, 2025Updated 5 months ago
- Generate Large Language Model text in a container.☆20Mar 24, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Science-driven chatbot development☆65May 5, 2024Updated 2 years ago
- Minimalistic, pluggable Golang evloop/timer handler with dependency-injection☆16Aug 12, 2018Updated 7 years ago
- Retail is sth like linux command tail, and support "retail" which means one can tail a file use a pos file which saves the last read posi…☆12Jun 26, 2013Updated 12 years ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆37Jul 28, 2023Updated 2 years ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- This repository represents my final assignment of "Module 3 - Android App Development" at Syntax Institut.☆27Jan 17, 2024Updated 2 years ago
- Dockerfile for johnsmith0031/alpaca_lora_4bit☆12Apr 10, 2023Updated 3 years ago
- A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda☆21Mar 27, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A reverse Geo lookup service written in C, accessible via HTTP and backed by OpenCage and LMDB☆15Aug 20, 2025Updated 9 months ago
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆73Jul 7, 2024Updated last year
- 数据库内核笔记☆14Aug 18, 2022Updated 3 years ago
- Where we keep our notes about model training runs.☆16Mar 12, 2023Updated 3 years ago
- Simplified installers for oobabooga/text-generation-webui.☆53Sep 16, 2023Updated 2 years ago
- 1-Click is all you need.☆63Apr 29, 2024Updated 2 years ago
- ❄️🐟 Fish completions for Nix☆13Aug 18, 2022Updated 3 years ago
- Resources for independent consultants, freelancers, freiberufler, selbständig in Germany and EMEA☆21Sep 9, 2025Updated 8 months ago
- JavaScript port of the path tracing algorithm from Peter Shirley's "Ray Tracing in One Weekend"☆11Jul 5, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Flash Attention in ~100 lines of CUDA (forward pass only)☆12Jun 10, 2024Updated last year
- Perceptually uniform colormaps with full range of lightness.☆17Jun 23, 2024Updated last year
- Fast and memory-efficient exact attention - Windows wheels☆36Apr 30, 2025Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Oct 29, 2024Updated last year
- AI Agents with Google's Gemini Pro and Gemini Pro Vision Models☆29Jan 19, 2024Updated 2 years ago
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆28Mar 1, 2023Updated 3 years ago
- Various scripts for working with local LLMs☆16Oct 19, 2023Updated 2 years ago