prasannakotyal/flash-attention-cuda

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/prasannakotyal/flash-attention-cuda)

prasannakotyal / flash-attention-cuda

Flash attention implementation Minimal CUDA implementation of Flash Attention with tiled computation and online softmax. Educational implementation based on Dao et al., 2022.

☆20

Alternatives and similar repositories for flash-attention-cuda

Users that are interested in flash-attention-cuda are comparing it to the libraries listed below

Sorting:

ialbluwi / psut-algorithms
View on GitHub
Material for the Design and Analysis of Algorithms course taught at Princess Sumaya University for Technology
☆58May 25, 2025Updated 9 months ago
jiasin88 / multi-step-ai-agent
View on GitHub
Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…
☆10Dec 19, 2024Updated last year
The-AI-Alliance / semiont
View on GitHub
AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.
☆29Updated this week
gerdm / martingale-posterior-neural-networks
View on GitHub
Martingale posterior neural networks for fast sequential decision making @ Neurips 2025
☆23Nov 13, 2025Updated 3 months ago
SAGE-3 / next
View on GitHub
Software to enable data-rich collaboration from high-resolution display walls to your laptop
☆16Updated this week
watson-developer-cloud / watsonx-orchestrate-developer-toolkit
View on GitHub
☆11Nov 10, 2025Updated 3 months ago
rajiviyer / genai_evaluation
View on GitHub
☆12Sep 21, 2023Updated 2 years ago
zhangzef / COOPER
View on GitHub
The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.
☆28Dec 30, 2025Updated 2 months ago
ranfysvalle02 / hacking-vectors
View on GitHub
Python script demonstrating the process of recovering text from embeddings, highlighting the associated privacy risks and mitigation stra…
☆19Nov 19, 2024Updated last year
Bob-lance / grok-mcp
View on GitHub
MCP server for Grok AI API integration
☆22Jun 2, 2025Updated 9 months ago
go-authgate / authgate
View on GitHub
A lightweight OAuth 2.0 Authorization Server supporting Device Authorization Grant (RFC 8628) and Authorization Code Flow with PKCE (RFC …
☆32Updated this week
iSEE-Laboratory / Long_RVOS
View on GitHub
(CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
☆27Feb 28, 2026Updated last week
mercator-ocean / oceanbench
View on GitHub
Benchmark evaluating ocean forecasting systems against reference datasets and observations.
☆26Updated this week
Embodied-Reasoning-Agent / Embodied-Reasoning-Agent
View on GitHub
☆32Feb 3, 2026Updated last month
ezelikman / anonymal
View on GitHub
Fast, free, easy, and object-agnostic video anonymization
☆11Dec 12, 2020Updated 5 years ago
ZuyiZhou / Awesome-Cross-modal-Reasoning-with-LLMs
View on GitHub
☆13Oct 21, 2024Updated last year
formare / auctions
View on GitHub
Auction Theory Toolbox – Computer Verified Auctions
☆14Jul 12, 2016Updated 9 years ago
sdsc-ordes / kg-llm-interface
View on GitHub
Langchain-powered natural language interface to knowledge-graphs.
☆17Nov 3, 2025Updated 4 months ago
zjowowen / GenerativeRL_Preview
View on GitHub
Python library for solving reinforcement learning (RL) problems using generative models.
☆11Feb 18, 2025Updated last year
jiajingyyyyyy / AutoTool
View on GitHub
[AAAI 2026] AutoTool: Efficient Tool Selection for Large Language Model Agents
☆29Dec 28, 2025Updated 2 months ago
montefiore-sail / appa
View on GitHub
Code for the publication "Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation".
☆24Dec 4, 2025Updated 3 months ago
protocol-security / fuzztools
View on GitHub
Struct-aware fuzzing framework + some fuzzers
☆30Jan 28, 2026Updated last month
stefanhaustein / tantilla
View on GitHub
Mobile IDE
☆12Nov 9, 2020Updated 5 years ago
a-antoniades / swe-search
View on GitHub
☆12Nov 5, 2024Updated last year
WeChatCV / UnicBench
View on GitHub
UnicEdit-10M and UnicBench project
☆23Mar 3, 2026Updated last week
modaic-ai / gepa-rpc
View on GitHub
Run GEPA on your favorite non-python libraries.
☆33Jan 22, 2026Updated last month
sonarme / luke
View on GitHub
DEPRECATED, since we cannot maintain this Luke repo any longer. Please fork / Luke fork for Lucene 4.3 (mavenized)
☆16May 12, 2021Updated 4 years ago
CurvSurf / FindSurface-RealityKit-visionOS
View on GitHub
A sample project for visionOS that showcases FindSurface's functionalities.
☆13Dec 18, 2025Updated 2 months ago
TheRobotStudio / V2_DexHand
View on GitHub
The stl files and code for the V2 DexHand
☆51May 26, 2025Updated 9 months ago
syuya2036 / ralph-loop
View on GitHub
This repository implements the "Ralph" autonomous coding loop pattern, designed to be agnostic of the specific AI agent being used. Wheth…
☆31Jan 7, 2026Updated 2 months ago
apple / ml-sid-dit
View on GitHub
☆41Oct 29, 2025Updated 4 months ago
projnanda / NEST
View on GitHub
☆25Dec 19, 2025Updated 2 months ago
FantasyFish / AI-Rabbit-R1
View on GitHub
The open-source language model computer
☆10Mar 22, 2024Updated last year
kmkrofficial / LiteGPT
View on GitHub
LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.
☆34Dec 16, 2025Updated 2 months ago
MattTuttle / Flume2D
View on GitHub
A game engine made in Java using libgdx (Currently in alpha state, and probably will remain that way)
☆16Jan 4, 2012Updated 14 years ago
gavinaboulhosn / SwiftMCP
View on GitHub
Swift Implementation of the Model Context Protocol (MCP) Spec
☆10Mar 28, 2025Updated 11 months ago
NightTrek / Supabase-MCP
View on GitHub
A model context protocol implementation granting LLMs access to make database queries and learn about supabase types.
☆14Dec 13, 2024Updated last year
Exadra37 / ai-intent-driven-development
View on GitHub
AI Intent Driven Development (IDD) guidelines and instructions for AI Coding Agents, AI Coding Assistants, and LLMs.
☆29Jan 27, 2026Updated last month
yangdongchao / Omni-AutoThink
View on GitHub
Adaptive Multimodal Reasoning via Reinforcement Learning
☆23Jan 11, 2026Updated last month