Fast Multi-dimensional Sparse Attention
☆753May 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for NATTEN
Users that are interested in NATTEN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,178May 15, 2024Updated 2 years ago
- [ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.☆995Feb 25, 2026Updated 3 months ago
- EDM2 and Autoguidance -- Official PyTorch implementation☆844Dec 9, 2024Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- [WIP] Better (FP8) attention for Hopper☆33Feb 24, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-t…☆3,388Jan 17, 2026Updated 4 months ago
- New flexible and efficient image generation framework that sets new SOTA on FFHQ-256 with FID 2.05, 2022☆102Jun 26, 2025Updated 11 months ago
- Helpful tools and examples for working with flex-attention☆1,190Apr 13, 2026Updated last month
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,635Mar 16, 2025Updated last year
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆215Sep 27, 2025Updated 8 months ago
- Tile primitives for speedy kernels☆3,377May 22, 2026Updated last week
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,166Dec 22, 2025Updated 5 months ago
- Karras et al. (2022) diffusion models for PyTorch☆2,589Feb 12, 2026Updated 3 months ago
- ☆237Oct 11, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Efficient vision foundation models for high-resolution generation and perception.☆3,310Sep 5, 2025Updated 8 months ago
- VideoSys: An easy and efficient system for video generation☆2,023Aug 27, 2025Updated 9 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,504May 22, 2026Updated last week
- ☆81Dec 27, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- ☆24Jun 18, 2024Updated last year
- ☆192Jan 14, 2025Updated last year
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆634Jul 1, 2024Updated last year
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆10,475May 21, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,299Oct 31, 2024Updated last year
- Ring attention implementation with flash attention☆1,021Sep 10, 2025Updated 8 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,324Jun 8, 2025Updated 11 months ago
- ☆267Jul 11, 2024Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,594May 31, 2024Updated last year
- Fast and memory-efficient exact attention☆23,917Updated this week
- 🚀 Efficient implementations for emerging model architectures☆5,139Updated this week
- ☆44Oct 26, 2024Updated last year
- USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference☆670May 21, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism☆2,622May 21, 2026Updated last week
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆665Mar 6, 2026Updated 2 months ago
- Official repo for CFG-Zero*☆705May 2, 2025Updated last year
- A suite of image and video neural tokenizers☆1,726Feb 11, 2025Updated last year
- ☆178Jan 8, 2026Updated 4 months ago
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆462Oct 29, 2025Updated 7 months ago
- A Quirky Assortment of CuTe Kernels☆985May 20, 2026Updated last week