jllllll / flash-attentionLinks
Fast and memory-efficient exact attention - Windows wheels
☆33Updated last year
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below
Sorting:
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆55Updated last year
- A TTS extension for oobabooga text WebUI☆32Updated last year
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆74Updated last year
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated last year
- Diffusion_TTS extension for booga☆67Updated last year
- Hard Reload oobabooga text WebUI extensions☆18Updated 5 months ago
- An extension to Oobabooga to add a simple memory function for chat☆25Updated 2 years ago
- SoTA open-source TTS☆43Updated 3 weeks ago
- Science-driven chatbot development☆58Updated last year
- ☆51Updated 8 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 10 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆155Updated last year
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆54Updated 8 months ago
- Wheels for llama-cpp-python compiled with cuBLAS support☆25Updated 3 months ago
- Oobabooga Text-Gen Web UI extension: get web content, add to context☆21Updated last year
- Wheels for llama-cpp-python compiled with cuBLAS support☆97Updated last year
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- This plugin forces models to output JSON of a specified schema using JSONFormer☆27Updated 8 months ago
- Llama cute voice assistant☆28Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆141Updated last month
- Fast and memory-efficient exact attention - Windows wheels☆38Updated 2 months ago
- 8-bit CUDA functions for PyTorch☆25Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated 8 months ago
- Train Llama Loras Easily☆31Updated last year
- Traing PRO extension for oobabooga WebUI - recent dev version☆50Updated 3 weeks ago
- An auto save extension for text generated with the oobabooga WebUI☆24Updated last year
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆21Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆100Updated last week
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆33Updated this week