Fast and memory-efficient exact attention
☆21Jun 3, 2026Updated last week
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Bytecode level Implementation of Symbolic OpCode Translator For PaddlePaddle☆15Oct 13, 2023Updated 2 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- ☆17May 14, 2024Updated 2 years ago
- ☆57Apr 7, 2026Updated 2 months ago
- ☆11Nov 17, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of PhD, RA, and Intern openings in Computer Science (CS), Electrical & Computer Engineering (ECE), and Artificial Intellig…☆21Sep 1, 2025Updated 9 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆18Mar 26, 2025Updated last year
- ☆17Feb 5, 2023Updated 3 years ago
- Real time high precision network monitor☆10Feb 24, 2019Updated 7 years ago
- ☆18Apr 8, 2025Updated last year
- A tool to convert ADE20K to COCO format based on desired objects☆15Jun 10, 2024Updated 2 years ago
- Source code of our ICML 2025 paper "Flowing Datasets with Wasserstein over Wasserstein Gradient Flows"☆20May 21, 2025Updated last year
- [AAAI'26] Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression☆19Dec 21, 2025Updated 5 months ago
- Here is the resources and code for the LotteryCodec.☆27Nov 3, 2025Updated 7 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- arok ui☆10Jun 20, 2023Updated 2 years ago
- A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper☆13Mar 12, 2019Updated 7 years ago
- [TMM 2025] Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression☆15Mar 28, 2025Updated last year
- ☆11Apr 5, 2021Updated 5 years ago
- Focused Papers, Delivered Simply :)☆55Dec 25, 2025Updated 5 months ago
- A C++ library for parsing and HLS playlists and demuxing Transport Streams☆11Aug 23, 2022Updated 3 years ago
- 演讲《GPU 驱动的发行版适配》☆11Mar 18, 2024Updated 2 years ago
- Unoffical Pytorch Implementation of Improving Inference for Neural Image Compression☆15Apr 27, 2025Updated last year
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ANFIC: Image Compression Using Augmented Normalizing Flows☆11Dec 31, 2021Updated 4 years ago
- SOTA benchmark☆18Aug 8, 2023Updated 2 years ago
- [NeurIPS 2025 𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭] AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆45Mar 28, 2026Updated 2 months ago
- CORE: Automatic Molecule Optimization using Copy & Refine Strategy (AAAI 2020)☆17Jul 17, 2023Updated 2 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆12Jun 12, 2023Updated 3 years ago
- Clustering by fast search and find of density peaks☆13Jan 8, 2016Updated 10 years ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆26Feb 18, 2025Updated last year
- ncnn android robust video matting☆22May 27, 2026Updated 2 weeks ago
- [ICCV 2025 Highlight] official code of paper "DLF: Extreme Image Compression with Dual-generative Latent Fusion"☆45Dec 24, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- nethserver-arm issue tracker☆10Sep 25, 2021Updated 4 years ago
- [TCSVT 2023] RDO-PTQ: Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression☆19Nov 1, 2023Updated 2 years ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- ☆14May 25, 2022Updated 4 years ago
- [ECCV2022] Gumbel Optimised Loss for Long Tailed Instance Segmentation.☆18Nov 24, 2022Updated 3 years ago
- A boundary detection algorithm in microscopic images considering 3D information.☆12Sep 19, 2018Updated 7 years ago
- 关于无锁队列的知识☆11Feb 13, 2017Updated 9 years ago