Fast and memory-efficient exact attention
☆20Jul 22, 2024Updated last year
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple implementation of a deep linear Pytorch module☆21Oct 16, 2020Updated 5 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 3 years ago
- Graph neural network message passing reframed as a Transformer with local attention☆70Dec 24, 2022Updated 3 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Dec 4, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆54Mar 30, 2021Updated 5 years ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 7 months ago
- ☆15Jan 27, 2025Updated last year
- ☆13Apr 16, 2022Updated 3 years ago
- Deformable Convolutional Networks v2 with Pytorch☆10Jul 29, 2020Updated 5 years ago
- Block-Recurrent Dynamics in ViTs 🦖☆34Dec 24, 2025Updated 3 months ago
- [CVPR 2026 Fingdings] This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑La…☆28Mar 15, 2026Updated 3 weeks ago
- 《多模态大模型部署微调指南》快速部署/微调多模态大模型☆12Dec 4, 2024Updated last year
- FAQ for University of CaliforniaSanta Cruz 2019 Incoming Grads☆11Apr 4, 2019Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10May 24, 2020Updated 5 years ago
- ChEMBL Database used to create Lipinski Descriptors (ADME Pharmokinetic Profile) to use in a Random Forest Regression Model☆15Nov 8, 2020Updated 5 years ago
- This project implements the Titans architecture from the paper "Titans: Learning to Memorize at Test Time" for market data prediction.☆11Jan 19, 2025Updated last year
- Pixie pipeline described in Liu et al., Robust phenotyping of highly multiplexed tissue imaging data using pixel-level clustering☆12Aug 3, 2023Updated 2 years ago
- Pytorch implementation of "Very Deep Graph Neural Networks via Noise Regularisation"☆10Aug 22, 2021Updated 4 years ago
- Code for H. Narasimhan, "Learning with Complex Loss Functions and Constraints", AISTATS 2018☆11Mar 21, 2018Updated 8 years ago
- ☆10Jun 4, 2024Updated last year
- Implementation of "Denoise Pretraining on Non-equilibrium Molecular Conformations for Accurate and Transferable Neural Potentials" in PyT…☆14Jul 26, 2023Updated 2 years ago
- ☆13Dec 1, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The package used to win the ARIEL challenge 2023 by the AstroAI team @ CfA | Harvard & Smithsonian☆11Jul 31, 2023Updated 2 years ago
- An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more☆12Jan 14, 2026Updated 2 months ago
- ☆16Oct 20, 2025Updated 5 months ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Aug 30, 2022Updated 3 years ago
- Neptune - TensorBoard integration 🧩 Experiment tracking with advanced UI, collaborative features, and user access management.☆13Sep 4, 2025Updated 7 months ago
- ☆10Jan 29, 2021Updated 5 years ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 11 months ago
- Experimental RISC-V assembler code snippets☆10Oct 23, 2019Updated 6 years ago
- AlphaGeometry2 symbolic engine (DDAR) with examples☆61Jan 7, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Dec 12, 2023Updated 2 years ago
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 4 years ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57May 17, 2024Updated last year
- Neural Distributed Image Compression using Cross-Attention Feature Alignment (NDIC-CAM) [WACV 2023]☆12Jul 19, 2022Updated 3 years ago
- 🎓 The chrome extension to make learning from YouTube faster & easier.☆11Jan 9, 2022Updated 4 years ago
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- The implement of geometric solver PGPSNet☆30Jan 30, 2025Updated last year