A fork of textgen that kept some things like Exllama and old GPTQ.
☆22Aug 20, 2024Updated last year
Alternatives and similar repositories for text-generation-webui-testing
Users that are interested in text-generation-webui-testing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32May 25, 2023Updated 2 years ago
- A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda☆21Mar 27, 2023Updated 3 years ago
- ☆536Dec 1, 2023Updated 2 years ago
- Collection of various text datasets to assist ML researchers in training or fine-tuning their models☆21Apr 1, 2023Updated 3 years ago
- A guidance compatibility layer for llama-cpp-python☆36Sep 11, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multichannel Looper/Feedback System for Riffusion☆14May 6, 2023Updated 3 years ago
- OnePlus 8T Param Read/Write☆14Dec 4, 2020Updated 5 years ago
- Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models☆26Sep 14, 2025Updated 7 months ago
- ☆15Apr 11, 2023Updated 3 years ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- A Simple webserver for generating text with exllamav2☆14Dec 18, 2023Updated 2 years ago
- LangChain + llamaCPP + babyAGI implementation☆13Apr 12, 2023Updated 3 years ago
- Port of Facebook's LLaMA model in C/C++☆16Jul 3, 2023Updated 2 years ago
- 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆41Aug 4, 2023Updated 2 years ago
- Steering LLM Thinking with Budget Guidance☆30Feb 19, 2026Updated 2 months ago
- CNC hand-held pendant for OpenBuilds CONTROL☆12Aug 12, 2025Updated 8 months ago
- LLM Quantization toolkit☆20May 2, 2026Updated last week
- CI scripts designed to build a Pascal-compatible version of vLLM.☆12Aug 10, 2024Updated last year
- A curated list of my GitHub stars!☆23Updated this week
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input a target size and the toolchain w…☆124Updated this week
- Stalker Checker MAC Generator Portal IPTV FREE☆17Nov 18, 2024Updated last year
- Efficient 3bit/4bit quantization of LLaMA models☆18May 18, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Mar 18, 2026Updated last month
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆37Jul 28, 2023Updated 2 years ago
- Hodge podge random stuff☆10Jan 20, 2017Updated 9 years ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆79Dec 17, 2024Updated last year
- Network for procedural editing of text with LLMs☆23Apr 28, 2026Updated last week
- MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models, sloppily ported to cog/replicate☆12Apr 25, 2023Updated 3 years ago
- (⚠️DO NOT FORK⚠️) Integrate Magisk root and Google Apps (and more) into WSA☆11Jan 28, 2023Updated 3 years ago
- Extract moc3 live2d model from .lpk file with GUI☆18Aug 9, 2023Updated 2 years ago
- ☆10Mar 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ZURB's Foundation framework, LESS powered☆31Jan 21, 2012Updated 14 years ago
- Screen space global illumination for interactive mixed reality☆14Dec 13, 2017Updated 8 years ago
- 🎨 Digital line art inspired by Georg Nees.☆17Dec 21, 2018Updated 7 years ago
- A TypeScript example showcasing the integration of Ollama with the Model Context Protocol (MCP) servers. This project provides an interac…☆29Aug 21, 2025Updated 8 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated 2 years ago
- Quantized inference code for LLaMA models☆13Mar 12, 2023Updated 3 years ago
- This application serves as a demonstration of the integration of langchain.js, Ollama, and ChromaDB to showcase question-answering capabi…☆27Feb 11, 2024Updated 2 years ago