SHI-Labs / NATTENLinks

Fast Multi-dimensional Sparse Attention

☆654

Alternatives and similar repositories for NATTEN

Users that are interested in NATTEN are comparing it to the libraries listed below

Sorting:

naver-ai / rope-vit
[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"
☆409Updated 2 weeks ago
NVlabs / DiffiT
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
☆503Updated last year
lucidrains / mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
☆471Updated 10 months ago
sihyun-yu / REPA
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,412Updated 8 months ago
lucidrains / ema-pytorch
A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model
☆621Updated 11 months ago
kyleliang919 / C-Optim
When it comes to optimizers, it's always better to be safe than sorry
☆378Updated last month
sail-sg / MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
☆582Updated last year
NVlabs / edm2
EDM2 and Autoguidance -- Official PyTorch implementation
☆790Updated 11 months ago
bytedance / 1d-tokenizer
This repo contains the code for 1D tokenizer and generator
☆1,073Updated 7 months ago
hustvl / LightningDiT
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,276Updated 5 months ago
willisma / SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
☆1,021Updated last week
Anima-Lab / MaskDiT
Code for Fast Training of Diffusion Models with Masked Transformers
☆416Updated last year
sony / ctm
☆310Updated last year
hutaiHang / Faster-Diffusion
[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
☆345Updated 8 months ago
facebookresearch / ToMe
A method to increase the speed and lower the memory footprint of existing vision transformers.
☆1,117Updated last year
lucidrains / rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
☆779Updated 3 months ago
horseee / DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
☆940Updated last year
CompVis / zigma
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
☆339Updated 7 months ago
zugexiaodui / torch_flops
A library for calculating the FLOPs in the forward() process based on torch.fx
☆130Updated 7 months ago
zhuyu-cs / MeanFlow
Pytorch implementation of MeanFlow on ImageNet and CIFAR10
☆338Updated 2 months ago
SHI-Labs / Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
☆1,155Updated last year
tianweiy / DMD2
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,051Updated 8 months ago
feizc / DiT-MoE
Scaling Diffusion Transformers with Mixture of Experts
☆400Updated last year
cloneofsimo / vqgan-training
Train VAE like a boss
☆298Updated last year
baofff / U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
☆1,059Updated 2 years ago
MCG-NJU / DDT
DDT: Decoupled Diffusion Transformer
☆317Updated 2 months ago
whlzy / FiT
[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model
☆426Updated last year
valeoai / Halton-MaskGIT
[ICLR2025] Halton Scheduler for Masked Generative Image Transformer
☆275Updated 2 weeks ago
zai-org / Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
☆441Updated last year
NVlabs / FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
☆886Updated 3 months ago