RiseAI-Sys/attention-gym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RiseAI-Sys/attention-gym)

RiseAI-Sys / attention-gym

Triton based sparse quantization attention kernel collection

☆43

Alternatives and similar repositories for attention-gym

Users that are interested in attention-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RiseAI-Sys / DAX
View on GitHub
High performance inference engine for diffusion models
☆107Sep 5, 2025Updated 10 months ago
HarryWu99 / funny_cute
View on GitHub
Some funny cute/cuteDSL code snippets
☆33Mar 2, 2026Updated 4 months ago
ziplab / efficient-stable-diffusion
View on GitHub
☆16Sep 12, 2023Updated 2 years ago
jt-zhang / Sparse_Attention_API
View on GitHub
☆66Oct 25, 2025Updated 9 months ago
serdes21 / flashtile
View on GitHub
FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.
☆61Feb 6, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WaveSpeedAI / QuantumAttention
View on GitHub
[WIP] Better (FP8) attention for Hopper
☆33Feb 24, 2025Updated last year
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
KuangjuX / NVSHMEM-Tutorial
View on GitHub
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
☆195Feb 11, 2026Updated 5 months ago
flagos-ai / libtriton_jit
View on GitHub
A Triton JIT runtime and ffi provider in C++
☆37Updated this week
Phoenix8215 / learn-TensorRT-from-scratch
View on GitHub
learn TensorRT from scratch🥰
☆18Sep 29, 2024Updated last year
NTT123 / cute-viz
View on GitHub
Cute layout visualization
☆44Jan 18, 2026Updated 6 months ago
vllm-project / tml-fa4
View on GitHub
FA4-based Relative Attention Kernel developed by TML and Colfax
☆17Jul 17, 2026Updated last week
verl-project / rl-insight
View on GitHub
Provide performance insight capabilities for RL frameworks.
☆47Updated this week
QwenLM / FlashQLA
View on GitHub
high-performance linear attention kernel library built on TileLang
☆606Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
inclusionAI / Awex
View on GitHub
A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from trainin…
☆166Updated this week
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 6 months ago
gofreelee / SpaceServe
View on GitHub
☆32Jul 13, 2026Updated last week
kyutai-labs / jax-flash-attn3
View on GitHub
JAX bindings for the flash-attention3 kernels
☆23Jan 2, 2026Updated 6 months ago
ISU-RCL / cvBench
View on GitHub
Benchmarking Analysis of Vision Kernels on Embedded CPU, GPU and FPGA
☆16Apr 21, 2019Updated 7 years ago
KuangjuX / cuda-evolve-oss
View on GitHub
Autonomous GPU kernel optimization system driven by AI agents.
☆31Mar 29, 2026Updated 3 months ago
NVIDIA / nsight-python
View on GitHub
Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools
☆283Updated this week
GitBoSun / TutteNet
View on GitHub
☆17Jul 26, 2024Updated last year
LeapLabTHU / UniTTA
View on GitHub
☆21Mar 5, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
JieRen98 / SGEMM-SASS-Annotation
View on GitHub
☆21Mar 22, 2021Updated 5 years ago
Zhaojp-Frank / AwesomePaper-for-AI
View on GitHub
Awesome system papers for AI
☆21Updated this week
2y7c3 / 3DV-TON
View on GitHub
3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models, in MM 2025.
☆17Aug 28, 2025Updated 10 months ago
MingXiangL / Teacache-xDiT
View on GitHub
Combining Teacache with xDiT to Accelerate Visual Generation Models
☆33Apr 21, 2025Updated last year
DiscreteTom / dt-blog-boilerplate
View on GitHub
DiscreteTom's Blog Boilerplate.
☆10Mar 6, 2023Updated 3 years ago
starhiking / ATF
View on GitHub
Face alignment，Facial Landmark detection ，ACM Multimedia Conference 2020
☆12Dec 8, 2022Updated 3 years ago
LinHuang17 / NVF-code
View on GitHub
☆15Jun 21, 2023Updated 3 years ago
vllm-project / vllm-daily
View on GitHub
vLLM Daily Summarization of Merged PRs
☆51Jul 16, 2026Updated last week
qhfan / MALA
View on GitHub
[ICCV2025 highlight]Rectifying Magnitude Neglect in Linear Attention
☆63Jul 24, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
aikitoria / nanotrace
View on GitHub
Low overhead tracing library and trace visualizer for pipelined CUDA kernels
☆137Jul 17, 2026Updated last week
BigAandSmallq / SAD
View on GitHub
Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework f…
☆31Nov 4, 2025Updated 8 months ago
LeapLabTHU / ENAT
View on GitHub
[NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
☆25Nov 28, 2024Updated last year
thu-ml / SLA
View on GitHub
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention
☆324Feb 24, 2026Updated 5 months ago
meta-pytorch / tritonbench
View on GitHub
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
☆362Updated this week
tile-ai / AttentionEngine
View on GitHub
☆52May 19, 2025Updated last year
CareF / CareF-knowledge-lib
View on GitHub
Personal knowledge library
☆10Nov 9, 2017Updated 8 years ago