lucidrains / LVMAE-pytorch
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
☆49Updated 5 months ago
Alternatives and similar repositories for LVMAE-pytorch:
Users that are interested in LVMAE-pytorch are comparing it to the libraries listed below
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- ☆45Updated last month
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆113Updated 3 months ago
- ☆30Updated 3 months ago
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers"☆65Updated last month
- ☆159Updated 4 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆76Updated 5 months ago
- ☆70Updated 5 months ago
- Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆29Updated 2 weeks ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 2 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆68Updated 7 months ago
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆21Updated 2 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆32Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆68Updated 6 months ago
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated last year
- The official repo of continuous speculative decoding☆26Updated last month
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆37Updated 7 months ago
- VQVAE for video prediction☆27Updated 3 years ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 5 months ago
- ☆61Updated last year
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆88Updated last week
- The official implementation of "[MASK] is All You Need"☆116Updated 2 months ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆172Updated 10 months ago
- Towards training VQ-VAE models robustly!☆72Updated 4 months ago
- Code for paper "Principal Components" Enable A New Language of Images☆40Updated 3 weeks ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆136Updated 3 months ago
- Official repository of paper "Subobject-level Image Tokenization"☆70Updated last month
- An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆170Updated this week
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆58Updated last week
- Minimal Implementation of Visual Autoregressive Modelling (VAR)☆33Updated last month