Fast and memory-efficient exact attention
☆32Dec 2, 2024Updated last year
Alternatives and similar repositories for flash-attention-3
Users that are interested in flash-attention-3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Basic world models☆32Oct 30, 2025Updated 7 months ago
- SGLang Kernel Wheel Index☆23Updated this week
- CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation☆51Apr 9, 2026Updated 2 months ago
- ☆24Jun 18, 2024Updated last year
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆19Oct 21, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The entire open source TokenRing ecosystem☆18Jun 4, 2026Updated last week
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated last year
- ☆27May 3, 2024Updated 2 years ago
- ☆13Updated this week
- Implement FlashAttention v2 with minimal code to learn.☆16Jun 12, 2024Updated 2 years ago
- ☆13Jan 15, 2023Updated 3 years ago
- Joint image and Depth inpainting, ldm3d☆16Apr 28, 2024Updated 2 years ago
- ☆17Apr 9, 2025Updated last year
- Teaching materials for improving research software writing abilities.☆14Apr 16, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hardware Division Units☆10Jul 17, 2014Updated 11 years ago
- Example of using next.js, nextauth.js and typescript for both anonymous sessions and authenticated sessions☆10Feb 6, 2024Updated 2 years ago
- In this project, we propose to study Vision Transformers trained using the Barlow Twins self-supervised method, and compare the results w…☆16Oct 3, 2023Updated 2 years ago
- KSimply: An AI Potential Analyzer that recommends open-source models based on user hardware. / Un analizzatore di potenziale AI che consi…☆20Jun 2, 2026Updated last week
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- Implementation of a holodeck, written in Pytorch☆19Nov 1, 2023Updated 2 years ago
- This project implements the Titans architecture from the paper "Titans: Learning to Memorize at Test Time" for market data prediction.☆10Jan 19, 2025Updated last year
- DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning☆24Apr 17, 2021Updated 5 years ago
- LLM inference in C/C++☆21Oct 22, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Monorepo for the Pearl network 🐚☆196Updated this week
- A Next.js v15+ template with Tailwind v3+, featuring Microsoft Entra ID authentication via Next-Auth v5+ and a Microsoft Graph Client int…☆10Mar 21, 2026Updated 2 months ago
- ☆12Jun 4, 2024Updated 2 years ago
- Triton implementation of Flash Attention2.0☆54Jul 31, 2023Updated 2 years ago
- Base for the Viral Genomics and Bioinformatics Repository☆17Jul 5, 2024Updated last year
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- ☆15Jun 5, 2023Updated 3 years ago
- ☆17Oct 20, 2025Updated 7 months ago
- (Verilog) A simple convolution layer implementation with systolic array structure☆13May 9, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Basic floating-point components for RISC-V processors☆12Aug 13, 2017Updated 8 years ago
- A implement of run-length encoding for Pytorch tensor using CUDA☆14Apr 7, 2021Updated 5 years ago
- Associative scan package for DRYing some code between repos☆18Jan 5, 2026Updated 5 months ago
- PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting [CVPPA: ICCVW 2025]☆34Nov 1, 2025Updated 7 months ago
- unsigned Radix-2 SRT division,基2除法☆16May 12, 2015Updated 11 years ago
- Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation (CVPR2023)☆14Jul 21, 2023Updated 2 years ago
- [IEEE PCS 2022 best paper finalist] "FloLPIPS: A Bespoke Video Quality Metric for Frame Interpoation", Duolikun Danier, Fan Zhang, David …☆22Mar 9, 2024Updated 2 years ago