jllllll / GPTQ-for-LLaMa-CUDA
A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for GPTQ-for-LLaMa-CUDA
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated 11 months ago
- An extension to Oobabooga to add a simple memory function for chat☆23Updated last year
- Wheels for llama-cpp-python compiled with cuBLAS support☆18Updated last month
- A Qt GUI for large language models☆40Updated 11 months ago
- Prompt Jinja2 templates for LLMs☆27Updated 2 months ago
- ☆27Updated last year
- annoy long term memory experiment for oobabooga/text-generation-webui☆31Updated last year
- Science-driven chatbot development☆55Updated 6 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- ☆15Updated 8 months ago
- A QT GUI for large language models☆24Updated 10 months ago
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆64Updated 4 months ago
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆107Updated last week
- Loader extension for tabbyAPI in SillyTavern☆21Updated 3 months ago
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Updated last year
- A simple updated colab doc that will allow you to run the Ooba Booga Text-Generation-Webui for free with just a few lines of codes.☆22Updated last month
- LLM backed Fantasy Tribe Game☆17Updated this week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆28Updated 3 months ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆34Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆73Updated last year
- Fast and memory-efficient exact attention - Windows wheels☆31Updated 8 months ago
- RAG implementation for Ooba characters. dynamically spins up new qdrant vector DB and manages retrieval and commits for conversations ba…☆44Updated last year
- ☆31Updated 10 months ago
- Web page with political compass quiz results for open LLMs☆37Updated 9 months ago
- 8-bit CUDA functions for PyTorch☆22Updated 11 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆27Updated this week
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated 7 months ago
- An Extension for oobabooga/text-generation-webui☆36Updated last year