jllllll / GPTQ-for-LLaMa-CUDA
A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for GPTQ-for-LLaMa-CUDA
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated last year
- Wheels for llama-cpp-python compiled with cuBLAS support☆18Updated last month
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆107Updated 3 weeks ago
- ☆27Updated last year
- Fast and memory-efficient exact attention - Windows wheels☆31Updated 8 months ago
- Science-driven chatbot development☆55Updated 6 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated 8 months ago
- A Qt GUI for large language models☆40Updated last year
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆34Updated last year
- annoy long term memory experiment for oobabooga/text-generation-webui☆31Updated last year
- An extension to Oobabooga to add a simple memory function for chat☆23Updated last year
- LLM backed Fantasy Tribe Game☆18Updated this week
- Experimental sampler to make LLMs more creative☆30Updated last year
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Updated last year
- Prompt Jinja2 templates for LLMs☆27Updated 2 months ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆64Updated 4 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Porting BabyAGI to Oobabooba.☆33Updated last year
- Loader extension for tabbyAPI in SillyTavern☆22Updated 3 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆42Updated 8 months ago
- Port of Facebook's LLaMA model in C/C++☆15Updated this week
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆28Updated 2 weeks ago
- Nexusflow function call, tool use, and agent benchmarks.☆14Updated this week
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated last year
- Large-Language-Model to Machine Interface project.☆17Updated 11 months ago
- ☆31Updated 10 months ago