Fast and memory-efficient exact attention
☆20Jul 22, 2024Updated last year
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 3 years ago
- Graph neural network message passing reframed as a Transformer with local attention☆70Dec 24, 2022Updated 3 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Dec 4, 2022Updated 3 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆54Mar 30, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆15Jan 27, 2025Updated last year
- ☆13Apr 16, 2022Updated 4 years ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- Hardware Division Units☆10Jul 17, 2014Updated 11 years ago
- FAQ for University of CaliforniaSanta Cruz 2019 Incoming Grads☆12Apr 4, 2019Updated 7 years ago
- ChEMBL Database used to create Lipinski Descriptors (ADME Pharmokinetic Profile) to use in a Random Forest Regression Model☆15Nov 8, 2020Updated 5 years ago
- Radam+lookahead implemented by tensorflow☆11Oct 14, 2019Updated 6 years ago
- A Pytorch implementation of Global Self-Attention Network, a fully-attention backbone for vision tasks☆94Nov 21, 2020Updated 5 years ago
- ☆12Jun 12, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pytorch implementation of "Very Deep Graph Neural Networks via Noise Regularisation"☆10Aug 22, 2021Updated 4 years ago
- ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)☆10Mar 9, 2024Updated 2 years ago
- Code for H. Narasimhan, "Learning with Complex Loss Functions and Constraints", AISTATS 2018☆11Mar 21, 2018Updated 8 years ago
- ☆12Jun 4, 2024Updated 2 years ago
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- ☆15Apr 9, 2026Updated 2 months ago
- The package used to win the ARIEL challenge 2023 by the AstroAI team @ CfA | Harvard & Smithsonian☆10Jul 31, 2023Updated 2 years ago
- tensorflow implementation for scoring blur image sharpness☆12Nov 29, 2017Updated 8 years ago
- ☆17Oct 20, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Aug 30, 2022Updated 3 years ago
- NoC based MPSoC☆11Jul 17, 2014Updated 11 years ago
- A PyTorch implementation of [VCT](https://github.com/google-research/google-research/tree/master/vct)☆10Nov 25, 2022Updated 3 years ago
- (Verilog) A simple convolution layer implementation with systolic array structure☆13May 9, 2022Updated 4 years ago
- Basic floating-point components for RISC-V processors☆12Aug 13, 2017Updated 8 years ago
- A implement of run-length encoding for Pytorch tensor using CUDA☆14Apr 7, 2021Updated 5 years ago
- Associative scan package for DRYing some code between repos☆18Jan 5, 2026Updated 5 months ago
- Showing full TensorBoard support in Tensorflow for a CNN using MNIST data.☆13Oct 19, 2019Updated 6 years ago
- Text to Speech with PyTorch (English and Mongolian)☆13May 3, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- unsigned Radix-2 SRT division,基2除法☆16May 12, 2015Updated 11 years ago
- ☆10Jan 29, 2021Updated 5 years ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated last year
- Experimental RISC-V assembler code snippets☆10Oct 23, 2019Updated 6 years ago
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 5 months ago
- Official implementation of "MadCLIP: Few-shot Medical Anomaly Detection with CLIP" (MICCAI 2025, Early Accepted).☆28Jul 24, 2025Updated 10 months ago