jllllll / flash-attentionLinks
Fast and memory-efficient exact attention - Windows wheels
☆33Updated last year
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below
Sorting:
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆72Updated last year
- Diffusion_TTS extension for booga☆69Updated 5 months ago
- A TTS extension for oobabooga text WebUI☆32Updated last year
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated 2 years ago
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆58Updated last year
- An extension to Oobabooga to add a simple memory function for chat☆25Updated 2 years ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆36Updated 2 years ago
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆23Updated 2 years ago
- Science-driven chatbot development☆61Updated last year
- Hard Reload oobabooga text WebUI extensions☆19Updated last year
- Fast and memory-efficient exact attention - Windows wheels