oobabooga / llama-cpp-python-cuBLAS-wheels
Wheels for llama-cpp-python compiled with cuBLAS support
☆17Updated last month
Related projects: ⓘ
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆21Updated 11 months ago
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated 10 months ago
- A Qt GUI for large language models☆40Updated 10 months ago
- Loader extension for tabbyAPI in SillyTavern☆18Updated last month
- Fast and memory-efficient exact attention - Windows wheels☆27Updated 6 months ago
- An extension to Oobabooga to add a simple memory function for chat☆23Updated last year
- ☆44Updated this week
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆62Updated 2 months ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆32Updated last year
- RAG implementation for Ooba characters. dynamically spins up new qdrant vector DB and manages retrieval and commits for conversations ba…☆45Updated 11 months ago
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆105Updated 4 months ago
- 8-bit CUDA functions for PyTorch☆22Updated 10 months ago
- Diffusion_TTS extension for booga☆59Updated 2 months ago
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated 10 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆34Updated last year
- A TTS extension for oobabooga text WebUI☆26Updated 4 months ago
- Science-driven chatbot development☆54Updated 4 months ago
- A custom extension for AUTOMATIC1111/stable-diffusion-webui to extend rest APIs to do some local operations, using in StableStudio.☆43Updated last year
- Experimental sampler to make LLMs more creative☆29Updated last year
- A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda☆21Updated last year
- An extension for text-generation-webui by oobabooga. Adds options to keep tabs on page and to move extensions into a sidebar.☆21Updated 11 months ago
- ☆17Updated 9 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆96Updated 4 months ago
- Port of Facebook's LLaMA model in C/C++☆15Updated last week
- annoy long term memory experiment for oobabooga/text-generation-webui☆31Updated last year
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆39Updated 2 months ago
- 8-bit CUDA functions for PyTorch☆45Updated last year
- ☆26Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated 6 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated 3 months ago