prasannakotyal / flash-attention-cudaView external linksLinks
Flash attention implementation Minimal CUDA implementation of Flash Attention with tiled computation and online softmax. Educational implementation based on Dao et al., 2022.
☆20Dec 27, 2025Updated last month
Alternatives and similar repositories for flash-attention-cuda
Users that are interested in flash-attention-cuda are comparing it to the libraries listed below
Sorting:
- Material for the Design and Analysis of Algorithms course taught at Princess Sumaya University for Technology☆51May 25, 2025Updated 8 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆24Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆22Nov 13, 2025Updated 3 months ago
- ☆11Nov 10, 2025Updated 3 months ago
- ☆12Sep 21, 2023Updated 2 years ago
- Python script demonstrating the process of recovering text from embeddings, highlighting the associated privacy risks and mitigation stra…☆18Nov 19, 2024Updated last year
- ☆28Feb 3, 2026Updated 2 weeks ago
- ☆13Oct 21, 2024Updated last year
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated last month
- MCP server for Grok AI API integration☆19Jun 2, 2025Updated 8 months ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆14Nov 25, 2024Updated last year
- It shows an intelligent agent based on LangGraph for long form writing.☆12Mar 1, 2025Updated 11 months ago
- ☆12Apr 19, 2024Updated last year
- A sample project for visionOS that showcases FindSurface's functionalities.☆13Dec 18, 2025Updated last month
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 3 months ago
- Agent-OM: Leveraging LLM Agents for Ontology Matching☆17Jan 24, 2026Updated 3 weeks ago
- Mobile IDE☆12Nov 9, 2020Updated 5 years ago
- A search index specialised for LaTeX equations. Developed for latexsearch.com.☆17Jul 15, 2011Updated 14 years ago
- [AAAI 2026] AutoTool: Efficient Tool Selection for Large Language Model Agents☆28Dec 28, 2025Updated last month
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated 11 months ago
- Run GEPA on your favorite non-python libraries.☆32Jan 22, 2026Updated 3 weeks ago
- Langchain-powered natural language interface to knowledge-graphs.☆17Nov 3, 2025Updated 3 months ago
- ☆24Oct 3, 2025Updated 4 months ago
- ☆12Nov 5, 2024Updated last year
- ☆24Updated this week
- The stl files and code for the V2 DexHand☆46May 26, 2025Updated 8 months ago
- Metadata browser of TREC☆10Jan 5, 2026Updated last month
- ☆25Dec 14, 2025Updated 2 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 3 weeks ago
- ☆11May 6, 2025Updated 9 months ago
- Swift Implementation of the Model Context Protocol (MCP) Spec☆10Mar 28, 2025Updated 10 months ago
- Demo app that shows how you can use D3.js with iOS in a UIWebView.☆10May 24, 2013Updated 12 years ago
- Using this LLM-powered tool you can seamlessly create high quality (tiktok type) videos☆11Sep 10, 2024Updated last year
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 2 months ago
- UnicEdit-10M and UnicBench project☆23Feb 8, 2026Updated last week
- Code for the publication "Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation".☆24Dec 4, 2025Updated 2 months ago