Fast and memory-efficient exact attention - Windows wheels
☆33Mar 3, 2024Updated 2 years ago
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below
Sorting:
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆23Oct 6, 2023Updated 2 years ago
- ☆13Oct 30, 2023Updated 2 years ago
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Aug 2, 2023Updated 2 years ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆36Jul 28, 2023Updated 2 years ago
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆58May 18, 2024Updated last year
- Wheels for llama-cpp-python compiled with cuBLAS support☆27Apr 9, 2025Updated 10 months ago
- Fast and memory-efficient exact attention - Windows wheels☆36Apr 30, 2025Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Mar 21, 2024Updated last year
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 2 years ago
- Simple Python script for generating W++ character descriptions via Poe's GPT☆10Jan 12, 2026Updated last month
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆72Jul 7, 2024Updated last year
- LLM Agent that performs sentiment analysis of drawings and natural language using a combination of Google Gemini Vision model and GPT-4 T…☆13Dec 22, 2023Updated 2 years ago
- Precompiled Wheels for GPTQ-for-LLaMa☆19Jul 26, 2023Updated 2 years ago
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆32Nov 20, 2023Updated 2 years ago
- Science-driven chatbot development☆62May 5, 2024Updated last year
- An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.☆17Apr 22, 2024Updated last year
- Official content repository.☆29Feb 24, 2026Updated last week
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Oct 22, 2024Updated last year
- An auto save extension for text generated with the oobabooga WebUI☆25Oct 6, 2025Updated 4 months ago
- Wheels for llama-cpp-python compiled with cuBLAS support☆102Feb 1, 2024Updated 2 years ago
- 8-bit CUDA functions for PyTorch☆26Nov 18, 2023Updated 2 years ago
- Oobabooga Text-Gen Web UI extension: get web content, add to context☆23Jun 1, 2024Updated last year
- A unified library for interacting with various AI APIs through a standardized interface.☆31Mar 13, 2025Updated 11 months ago
- This plugin forces models to output JSON of a specified schema using JSONFormer☆28Nov 16, 2024Updated last year
- Polyglot is a fast, elegant, and free translation tool using AI.☆64Nov 21, 2025Updated 3 months ago
- OpenVPN for Windows, with support for the Tunnelblick obfuscation patch☆31Dec 6, 2022Updated 3 years ago
- Web page with political compass quiz results for open LLMs☆38Jan 31, 2024Updated 2 years ago
- A set of TTS nodes for ComfyUI☆30Jun 14, 2024Updated last year
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 8 months ago
- A Streamlit app that transcribes audio using LLMware models, analyzes the text for insights, and includes an interactive Dragon model cha…☆13May 19, 2024Updated last year
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆33Jul 14, 2023Updated 2 years ago
- ☆13Apr 25, 2019Updated 6 years ago
- Web UI for ExLlamaV2☆512Feb 5, 2025Updated last year
- A TTS extension for oobabooga text WebUI☆31May 5, 2024Updated last year
- ALICE and its prior work, Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in Virtual Reality☆42Sep 4, 2024Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32May 25, 2023Updated 2 years ago
- This repository represents my final assignment of "Module 3 - Android App Development" at Syntax Institut.☆27Jan 17, 2024Updated 2 years ago
- Fuzzy Logic Library for Microsoft .Net☆10Jan 10, 2016Updated 10 years ago
- PyTorch implementation for "Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation"☆10Apr 11, 2024Updated last year