Fast and memory-efficient exact attention
☆20Jul 22, 2024Updated last year
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 7 months ago
- ☆15Jan 27, 2025Updated last year
- 汽车-androidAPP-物联网-蓝牙☆11Nov 29, 2017Updated 8 years ago
- Block-Recurrent Dynamics in ViTs 🦖☆36Dec 24, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Deformable Convolutional Networks v2 with Pytorch☆10Jul 29, 2020Updated 5 years ago
- [CVPR 2026 Fingdings] This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑La…☆28Mar 15, 2026Updated last month
- Hardware Division Units☆10Jul 17, 2014Updated 11 years ago
- RADIX-4 SRT division☆12Oct 31, 2019Updated 6 years ago
- FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving☆11Jan 22, 2024Updated 2 years ago
- This project implements the Titans architecture from the paper "Titans: Learning to Memorize at Test Time" for market data prediction.☆11Jan 19, 2025Updated last year
- A Pytorch implementation of Global Self-Attention Network, a fully-attention backbone for vision tasks☆94Nov 21, 2020Updated 5 years ago
- ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)☆10Mar 9, 2024Updated 2 years ago
- Code for H. Narasimhan, "Learning with Complex Loss Functions and Constraints", AISTATS 2018☆11Mar 21, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learning Transferable Features with Deep Adaptation Networks☆13Jul 18, 2023Updated 2 years ago
- ☆11Mar 28, 2021Updated 5 years ago
- ☆11Jun 4, 2024Updated last year
- Estimating hardware and cloud costs of LLMs and transformer projects☆21Apr 1, 2026Updated 3 weeks ago
- An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more☆12Jan 14, 2026Updated 3 months ago
- Implementation of "Denoise Pretraining on Non-equilibrium Molecular Conformations for Accurate and Transferable Neural Potentials" in PyT…☆14Jul 26, 2023Updated 2 years ago
- ☆14Apr 9, 2026Updated 2 weeks ago
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- The package used to win the ARIEL challenge 2023 by the AstroAI team @ CfA | Harvard & Smithsonian☆10Jul 31, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Aug 30, 2022Updated 3 years ago
- A simple cross attention that updates both the source and target in one step☆195Jul 29, 2025Updated 9 months ago
- NoC based MPSoC☆11Jul 17, 2014Updated 11 years ago
- Adaptive floating-point based numerical format for resilient deep learning☆14Apr 11, 2022Updated 4 years ago
- (Verilog) A simple convolution layer implementation with systolic array structure☆13May 9, 2022Updated 3 years ago
- Neptune - TensorBoard integration 🧩 Experiment tracking with advanced UI, collaborative features, and user access management.☆13Sep 4, 2025Updated 7 months ago
- Release code for light-weight calibrator: a separable component for unsupervised domain adaptation☆13Jul 17, 2021Updated 4 years ago
- Associative scan package for DRYing some code between repos☆18Jan 5, 2026Updated 3 months ago
- Showing full TensorBoard support in Tensorflow for a CNN using MNIST data.☆13Oct 19, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Text to Speech with PyTorch (English and Mongolian)☆13May 3, 2020Updated 5 years ago
- unsigned Radix-2 SRT division,基2除法☆16May 12, 2015Updated 10 years ago
- ☆12Jun 12, 2017Updated 8 years ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated last year
- Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation (CVPR2023)☆14Jul 21, 2023Updated 2 years ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57May 17, 2024Updated last year