Fast and memory-efficient exact attention - Windows wheels
☆36Apr 30, 2025Updated last year
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Frontend for Power Users.☆12Feb 7, 2024Updated 2 years ago
- Fast and memory-efficient exact attention☆939Dec 9, 2025Updated 6 months ago
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.☆31Jul 19, 2025Updated 11 months ago
- ☆12Jun 13, 2023Updated 3 years ago
- Helping build fair, safe, ethical, and RIGHT General Artificial Intelligence and helping to introduce humanity to (RG)AI through the magi…☆11Aug 6, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆21Jun 22, 2026Updated last week
- The installation of the InsightFace package on a Windows environment, including the necessary dependencies and configurations.☆18Sep 10, 2024Updated last year
- ☆10Feb 11, 2025Updated last year
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆23Oct 6, 2023Updated 2 years ago
- ☆13Oct 22, 2023Updated 2 years ago
- Stable Diffusion web UI☆11Sep 13, 2022Updated 3 years ago
- Fork of the Triton language and compiler for Windows support and easy installation☆1,944Feb 18, 2026Updated 4 months ago
- Sample based concatenative synthesizer for the NSynth dataset. Render any MIDI (.mid) sequence with the notes of NSynth.☆12Oct 4, 2023Updated 2 years ago
- ☆25Feb 21, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ComfyUI nodes to edit videos using Genmo Mochi☆24Nov 3, 2024Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- LLM Agent that performs sentiment analysis of drawings and natural language using a combination of Google Gemini Vision model and GPT-4 T…☆13Dec 22, 2023Updated 2 years ago
- [DEPRECEATED] Morpheus Music AI implementation spin-off :)☆16Oct 5, 2022Updated 3 years ago
- A native Application , With Agentic Support (MCP) for ultra fast AI image generation using a highly optimized Z-Image-Turbo model with SD…☆32Feb 3, 2026Updated 4 months ago
- Mining graph streams using dictionary-based compression☆16Aug 10, 2017Updated 8 years ago
- Add Epic Games' SDK to your Ren'Py games☆16Mar 1, 2026Updated 3 months ago
- ☆12Feb 6, 2024Updated 2 years ago
- Blender Icicle Generator for Blender 2.80☆15Nov 17, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- BAT file to quick installation git-versions "ComfyUI" & "SUPIR" v.2 (NVIDIA)☆10Apr 16, 2024Updated 2 years ago
- ☆18Sep 4, 2024Updated last year
- Hard Reload oobabooga text WebUI extensions☆20Jan 23, 2025Updated last year
- Source code for SuperAGI's Zapier Integration☆13Aug 20, 2023Updated 2 years ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆19Jan 10, 2025Updated last year
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- CLIP Interrogator, fully in HuggingFace Transformers 🤗, with LongCLIP & CLIP's own words and / or *your* own words!☆19Jul 18, 2025Updated 11 months ago
- ☆16Aug 1, 2024Updated last year
- ☆17Dec 28, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Aug 2, 2023Updated 2 years ago
- ☆17Jan 10, 2024Updated 2 years ago
- AI Reddit bot that scrapes subreddits for questions, conducts research, and posts automated answers to help users with relevant informati…☆19Sep 13, 2024Updated last year
- Official webpage of the Programming Group on https://programming-group.com☆12Jun 18, 2026Updated last week
- A prototype of a lazily made text tool alternative for Krita (Use at your own risk)☆17May 31, 2024Updated 2 years ago
- ☆29Dec 4, 2023Updated 2 years ago
- ☆21Dec 18, 2023Updated 2 years ago