Fast and memory-efficient exact attention
☆21Jun 26, 2026Updated last week
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Bytecode level Implementation of Symbolic OpCode Translator For PaddlePaddle☆15Oct 13, 2023Updated 2 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- ☆17May 14, 2024Updated 2 years ago
- ☆62Apr 7, 2026Updated 2 months ago
- ☆11Nov 17, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A curated list of PhD, RA, and Intern openings in Computer Science (CS), Electrical & Computer Engineering (ECE), and Artificial Intellig…☆22Sep 1, 2025Updated 10 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆18Mar 26, 2025Updated last year
- ☆16Feb 5, 2023Updated 3 years ago
- Real time high precision network monitor☆10Feb 24, 2019Updated 7 years ago
- ☆18Apr 8, 2025Updated last year
- A tool to convert ADE20K to COCO format based on desired objects☆15Jun 10, 2024Updated 2 years ago
- Source code of our ICML 2025 paper "Flowing Datasets with Wasserstein over Wasserstein Gradient Flows"☆20May 21, 2025Updated last year
- [AAAI'26] Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression☆19Dec 21, 2025Updated 6 months ago
- Here is the resources and code for the LotteryCodec.☆27Nov 3, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- arok ui☆10Jun 20, 2023Updated 3 years ago
- A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper☆14Mar 12, 2019Updated 7 years ago
- [TMM 2025] Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression☆15Mar 28, 2025Updated last year
- ☆11Apr 5, 2021Updated 5 years ago
- Focused Papers, Delivered Simply :)☆55Dec 25, 2025Updated 6 months ago
- A C++ library for parsing and HLS playlists and demuxing Transport Streams☆11Aug 23, 2022Updated 3 years ago
- 演讲《GPU 驱动的发行版适配》☆11Mar 18, 2024Updated 2 years ago
- Unoffical Pytorch Implementation of Improving Inference for Neural Image Compression☆15Apr 27, 2025Updated last year
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ANFIC: Image Compression Using Augmented Normalizing Flows☆11Dec 31, 2021Updated 4 years ago
- SOTA benchmark☆18Aug 8, 2023Updated 2 years ago
- [NeurIPS 2025 𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭] AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆45Mar 28, 2026Updated 3 months ago
- CORE: Automatic Molecule Optimization using Copy & Refine Strategy (AAAI 2020)☆17Jul 17, 2023Updated 2 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆12Jun 12, 2023Updated 3 years ago
- Clustering by fast search and find of density peaks☆13Jan 8, 2016Updated 10 years ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆26Feb 18, 2025Updated last year
- ncnn android robust video matting☆22May 27, 2026Updated last month
- [ICCV 2025 Highlight] official code of paper "DLF: Extreme Image Compression with Dual-generative Latent Fusion"☆47Dec 24, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- nethserver-arm issue tracker☆10Sep 25, 2021Updated 4 years ago
- [TCSVT 2023] RDO-PTQ: Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression☆22Nov 1, 2023Updated 2 years ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- ☆14May 25, 2022Updated 4 years ago
- [ECCV2022] Gumbel Optimised Loss for Long Tailed Instance Segmentation.☆18Nov 24, 2022Updated 3 years ago
- A boundary detection algorithm in microscopic images considering 3D information.☆12Sep 19, 2018Updated 7 years ago
- 关于无锁队列的知识☆11Feb 13, 2017Updated 9 years ago