jllllll / flash-attentionLinks
Fast and memory-efficient exact attention - Windows wheels
☆33Updated last year
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below
Sorting:
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆57Updated last year
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆73Updated last year
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated 2 years ago
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆22Updated 2 years ago
- An extension to Oobabooga to add a simple memory function for chat☆25Updated 2 years ago
- Diffusion_TTS extension for booga☆68Updated 3 months ago
- Hard Reload oobabooga text WebUI extensions☆19Updated 10 months ago
- Oobabooga Text-Gen Web UI extension: get web content, add to context☆23Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated last year
- This plugin forces models to output JSON of a specified schema using JSONFormer☆28Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆144Updated 4 months ago
- A TTS extension for oobabooga text WebUI☆32Updated last year
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated 2 years ago
- Science-driven chatbot development☆60Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆155Updated 2 years ago
- Writing Extension for Text Generation WebUI☆64Updated 4 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- Traing PRO extension for oobabooga WebUI - recent dev version☆52Updated 4 months ago
- A repository to store helpful information and emerging insights in regard to LLMs☆21Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- Attend - to what matters.☆17Updated 9 months ago
- Wheels for llama-cpp-python compiled with cuBLAS support☆27Updated 8 months ago
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated 2 years ago
- ☆13Updated 2 years ago
- Fast and memory-efficient exact attention - Windows wheels☆36Updated 7 months ago
- LLM backed Fantasy Tribe Game☆19Updated last year
- An auto save extension for text generated with the oobabooga WebUI☆25Updated 2 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆41Updated last year
- Loader extension for tabbyAPI in SillyTavern☆26Updated 5 months ago