Flash attention implementation Minimal CUDA implementation of Flash Attention with tiled computation and online softmax. Educational implementation based on Dao et al., 2022.
☆20Dec 27, 2025Updated 3 months ago
Alternatives and similar repositories for flash-attention-cuda
Users that are interested in flash-attention-cuda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project showcases a comprehensive analysis of CO2 emissions in a fictitious cheese manufacturing supply chain using both graph datab…☆11Sep 18, 2024Updated last year
- Material for the Design and Analysis of Algorithms course taught at Princess Sumaya University for Technology☆60Updated this week
- Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine☆16Apr 28, 2025Updated 11 months ago
- ☆11Nov 10, 2025Updated 4 months ago
- Launch and configuration files for running Nav2 on MVsim worlds☆19Jun 9, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Deploying Spark machine learning models to Azure☆15Mar 28, 2023Updated 3 years ago
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆25Aug 5, 2025Updated 7 months ago
- EE6427 Video Signal Processing☆16Jan 14, 2021Updated 5 years ago
- ☆24Nov 22, 2019Updated 6 years ago
- A ROS Action server that handles communication with move base action server to achieve a list of required goal poses successively.☆19Sep 19, 2021Updated 4 years ago
- ☆13Aug 9, 2023Updated 2 years ago
- ☆27Sep 11, 2025Updated 6 months ago
- ☆14Oct 6, 2024Updated last year
- Haskell bindings to Halide☆20Mar 18, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆24Dec 12, 2024Updated last year
- A tiny scalar-valued autograd engine and neural network library (Karpathy course)☆12Mar 16, 2026Updated last week
- Won 2nd prize in HackUIET hackathon and best project in AI theme☆20Jan 1, 2025Updated last year
- ☆32Oct 31, 2025Updated 4 months ago
- A search index specialised for LaTeX equations. Developed for latexsearch.com.☆17Jul 15, 2011Updated 14 years ago
- High performance implementation of Deep neuroevolution in pytorch using mpi4py. Intended for use on HPC clusters☆27Jan 24, 2022Updated 4 years ago
- Run RF-DETR on NVIDIA DeepStream☆26Jan 13, 2026Updated 2 months ago
- Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.☆63Dec 19, 2025Updated 3 months ago
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Automation in iOS, iPadOS and macOS☆22Jan 2, 2021Updated 5 years ago
- Personal Workspace setup☆24Sep 13, 2024Updated last year
- EXTD :: Extremely Tiny Face Detector via Iterative Filter Reuse☆13Jul 25, 2019Updated 6 years ago
- 🦄 Serving Platform for Spatial AI and Robotics.☆23Jun 19, 2025Updated 9 months ago
- Codebase for "WhAM: Towards A Translative Model of Sperm Whale Vocalization" (NeurIPS 2025)☆40Mar 10, 2026Updated 2 weeks ago
- ☆12Aug 6, 2022Updated 3 years ago
- CustomLLM config to leverage watsonx LLMs with continue.dev.☆17Aug 27, 2024Updated last year
- Official Codebase for "Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers"☆25Jun 7, 2025Updated 9 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Apr 19, 2024Updated last year
- ☆12Sep 21, 2023Updated 2 years ago
- Comfyui Node Pack☆31Sep 17, 2025Updated 6 months ago
- FLOPs and other statistics COunter for Pytorch neural networks☆23May 27, 2021Updated 4 years ago
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆15Dec 5, 2023Updated 2 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Feb 15, 2023Updated 3 years ago
- ☆12Feb 23, 2023Updated 3 years ago