Helpful tools and examples for working with flex-attention
β1,197May 28, 2026Updated 3 weeks ago
Alternatives and similar repositories for attention-gym
Users that are interested in attention-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch native platform for training generative AI modelsβ5,435Updated this week
- π Efficient implementations for emerging model architecturesβ5,227Jun 11, 2026Updated last week
- Tile primitives for speedy kernelsβ3,436Updated this week
- Ring attention implementation with flash attentionβ1,026Sep 10, 2025Updated 9 months ago
- FlashInfer: Kernel Library for LLM Servingβ5,791Updated this week
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Efficient Triton Kernels for LLM Trainingβ6,444Updated this week
- Distributed Compiler based on Triton for Parallel Systemsβ1,459Apr 22, 2026Updated last month
- A sparse attention kernel supporting mix sparse patternsβ525Jan 18, 2026Updated 5 months ago
- PyTorch native quantization and sparsity for training and inference