Flash attention implementation Minimal CUDA implementation of Flash Attention with tiled computation and online softmax. Educational implementation based on Dao et al., 2022.
☆20Dec 27, 2025Updated 2 months ago
Alternatives and similar repositories for flash-attention-cuda
Users that are interested in flash-attention-cuda are comparing it to the libraries listed below
Sorting:
- Material for the Design and Analysis of Algorithms course taught at Princess Sumaya University for Technology☆58May 25, 2025Updated 9 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Updated this week
- ☆11Nov 10, 2025Updated 3 months ago
- ☆12Sep 21, 2023Updated 2 years ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- Python script demonstrating the process of recovering text from embeddings, highlighting the associated privacy risks and mitigation stra…☆19Nov 19, 2024Updated last year
- MCP server for Grok AI API integration☆22Jun 2, 2025Updated 9 months ago
- A lightweight OAuth 2.0 Authorization Server supporting Device Authorization Grant (RFC 8628) and Authorization Code Flow with PKCE (RFC …☆32Updated this week
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆27Feb 28, 2026Updated last week
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week
- ☆32Feb 3, 2026Updated last month
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- ☆13Oct 21, 2024Updated last year
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- Langchain-powered natural language interface to knowledge-graphs.☆17Nov 3, 2025Updated 4 months ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- [AAAI 2026] AutoTool: Efficient Tool Selection for Large Language Model Agents☆29Dec 28, 2025Updated 2 months ago
- Code for the publication "Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation".☆24Dec 4, 2025Updated 3 months ago
- Struct-aware fuzzing framework + some fuzzers☆30Jan 28, 2026Updated last month
- Mobile IDE☆12Nov 9, 2020Updated 5 years ago
- ☆12Nov 5, 2024Updated last year
- UnicEdit-10M and UnicBench project☆23Mar 3, 2026Updated last week
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated last month
- DEPRECATED, since we cannot maintain this Luke repo any longer. Please fork / Luke fork for Lucene 4.3 (mavenized)☆16May 12, 2021Updated 4 years ago
- A sample project for visionOS that showcases FindSurface's functionalities.☆13Dec 18, 2025Updated 2 months ago
- The stl files and code for the V2 DexHand☆51May 26, 2025Updated 9 months ago
- This repository implements the "Ralph" autonomous coding loop pattern, designed to be agnostic of the specific AI agent being used. Wheth…☆31Jan 7, 2026Updated 2 months ago
- ☆41Oct 29, 2025Updated 4 months ago
- ☆25Dec 19, 2025Updated 2 months ago
- The open-source language model computer☆10Mar 22, 2024Updated last year
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆34Dec 16, 2025Updated 2 months ago
- A game engine made in Java using libgdx (Currently in alpha state, and probably will remain that way)☆16Jan 4, 2012Updated 14 years ago
- Swift Implementation of the Model Context Protocol (MCP) Spec☆10Mar 28, 2025Updated 11 months ago
- A model context protocol implementation granting LLMs access to make database queries and learn about supabase types.☆14Dec 13, 2024Updated last year
- AI Intent Driven Development (IDD) guidelines and instructions for AI Coding Agents, AI Coding Assistants, and LLMs.☆29Jan 27, 2026Updated last month
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated last month