lucidrains / LVMAE-pytorchLinks
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
☆51Updated 6 months ago
Alternatives and similar repositories for LVMAE-pytorch
Users that are interested in LVMAE-pytorch are comparing it to the libraries listed below
Sorting:
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 11 months ago
- ☆47Updated 2 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆30Updated last week
- The official repo of continuous speculative decoding☆26Updated 2 months ago
- ☆33Updated 3 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆32Updated last year
- Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆34Updated last month
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆118Updated 4 months ago
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆97Updated last week
- ☆71Updated 6 months ago
- Stable Consistency Tuning: Understanding and Improving Consistency models☆16Updated 6 months ago
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers"☆66Updated 2 months ago
- Native-resolution diffusion Transformer☆43Updated this week
- Implementation of the proposed MaskBit from Bytedance AI☆80Updated 6 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆37Updated 8 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆69Updated 7 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆115Updated 7 months ago
- 👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"☆27Updated 7 months ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆62Updated 3 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆63Updated 2 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆74Updated 3 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 6 months ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆104Updated 3 weeks ago
- ☆31Updated last week
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆65Updated last week
- Towards training VQ-VAE models robustly!☆74Updated 4 months ago
- Minimal Implementation of Visual Autoregressive Modelling (VAR)☆33Updated 2 months ago
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated 2 years ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆30Updated 7 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 6 months ago