A high-performance attention mechanism that computes softmax normalization in a single streaming pass using running accumulators (online softmax).
☆31Jun 3, 2026Updated last month
Alternatives and similar repositories for StreamAttn
Users that are interested in StreamAttn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning about CUDA by writing PTX code.☆159Feb 27, 2024Updated 2 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- Because it's there.☆16Sep 22, 2024Updated last year
- ☆24May 26, 2026Updated last month
- A test library for computing modular exponentiation in parallel using AVX-512 vector arithmetic☆12Dec 18, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Compare Bloxroute and Fiber transaction streams☆10Nov 22, 2024Updated last year
- Tendermint implementation of the blockchain of Aleo verifiable computing model built by LambdaClass☆15Feb 8, 2023Updated 3 years ago
- CDLS: Proving Knowledge of Committed Discrete Logarithms with Soundness☆12Apr 30, 2026Updated 2 months ago
- The entry point for Rust projects to be run on Valida☆10Mar 14, 2025Updated last year
- ☆32Jun 22, 2025Updated last year
- ☆12Jun 5, 2025Updated last year
- 🦎 Prototypes on polymorphic, metamorphic and poly-metamorphic malwares in Rust 🦎☆14Oct 8, 2023Updated 2 years ago
- ☆28Apr 19, 2026Updated 2 months ago
- Composable numerical solvers for unconstrained and simple-bounds constrained convex optimization problems in Rust. WASM compatible☆16Jul 10, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Oct 4, 2023Updated 2 years ago
- ☆12Sep 11, 2024Updated last year
- A system for scheduling serverless edge functions☆11Aug 11, 2020Updated 5 years ago
- ☆16Jan 24, 2025Updated last year
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆31Updated this week
- ☆12Feb 18, 2025Updated last year
- ☆20May 30, 2026Updated last month
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- ☆20Aug 23, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Sep 25, 2023Updated 2 years ago
- Highly experimental fault-proof program for Optimism Bedrock☆20Mar 27, 2023Updated 3 years ago
- ☆13Jun 24, 2025Updated last year
- A repo to train your own small sarvam-30b model☆78Mar 7, 2026Updated 3 months ago
- Fast approximation of similarity for sets of very different sizes☆20Mar 8, 2022Updated 4 years ago
- ☆22May 5, 2025Updated last year
- Verified implementations for the Noise family of protocols☆17Jun 18, 2024Updated 2 years ago
- MIRIS: Fast Object Track Queries in Video☆17Mar 24, 2023Updated 3 years ago
- Personal solutions to the Triton Puzzles☆21Jul 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A tag editor written in C# and WPF☆12Aug 20, 2023Updated 2 years ago
- ☆19Mar 3, 2025Updated last year
- General Matrix Multiplication using NVIDIA Tensor Cores☆28Jan 25, 2025Updated last year
- TiDB AI SDK: Unified Multi-Modal Data Platform for AI Apps & Agents☆31Mar 6, 2026Updated 3 months ago
- Causal inference library for timeseries analysis☆43Jun 22, 2026Updated last week
- Demo for learning frontrun bot arbitrage☆19Jun 18, 2023Updated 3 years ago
- Local LLM as a search relevance judge☆30Mar 2, 2025Updated last year