lucidrains / LVMAE-pytorchLinks

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

☆52

Alternatives and similar repositories for LVMAE-pytorch

Users that are interested in LVMAE-pytorch are comparing it to the libraries listed below

Sorting:

philippe-eecs / small-vision
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
☆34Updated last year
OliverRensu / MVAR
☆70Updated 8 months ago
lucidrains / titok-pytorch
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
☆176Updated last year
yinboc / dito
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆129Updated 6 months ago
zelaki / ReDi
Boosting Generative Image Modeling via Joint Image-Feature Synthesis
☆50Updated last month
FutureXiang / edm2
Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"
☆34Updated last year
NVlabs / GSPN
[CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network
☆104Updated 2 weeks ago
Gsunshine / py-meanflow
Pytorch implementation for MeanFlow
☆62Updated this week
NVlabs / DDO
[ICML 2025 Spotlight] Direct Discriminative Optimization: Supercharging Diffusion/Autoregressive with GAN-type Discrimination
☆90Updated last month
sangyun884 / rfpp
The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024
☆119Updated 9 months ago
LINs-lab / UCGM
[Preprint] UCGM: Unified Continuous Generative Models
☆165Updated 2 months ago
MarkXCloud / CSpD
The official repo of continuous speculative decoding
☆27Updated 4 months ago
hp-l33 / AiM
Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"
☆139Updated 6 months ago
lucidrains / multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
☆102Updated last year
zh460045050 / VQGAN-LC
☆131Updated last year
shim0114 / SSM-Meets-Video-Diffusion-Models
☆48Updated 4 months ago
shiml20 / FlowTurbo
[NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"
☆71Updated 10 months ago
Neur-IO / OptVQ
Towards training VQ-VAE models robustly!
☆79Updated 3 weeks ago
DAMO-NLP-SG / DiGIT
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆69Updated 9 months ago
lucidrains / maskbit-pytorch
Implementation of the proposed MaskBit from Bytedance AI
☆82Updated 8 months ago
maple-research-lab / SIM
Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]
☆81Updated 8 months ago
Gen-Verse / HermesFlow
HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
☆63Updated 5 months ago
TIGER-AI-Lab / Vamba
Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]
☆78Updated last week
feizc / DiS
Scalable Diffusion Models with State Space Backbone
☆156Updated last year
CompVis / discrete-interpolants
The official implementation of "[MASK] is All You Need"
☆122Updated last week
jacklishufan / OmniFlows
The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
☆79Updated last month
feizc / Dimba
Transformer-Mamba Diffusion Models
☆111Updated last year
ali-vilab / alitok
AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model
☆40Updated last month
feizc / Diffusion-RWKV
Scaling RWKV-Like Architectures for Diffusion Models
☆136Updated last year
mlvlab / CAF
Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024
☆33Updated 6 months ago