lucidrains / genie2-pytorchLinks
Implementation of a framework for Genie2 in Pytorch
☆149Updated 6 months ago
Alternatives and similar repositories for genie2-pytorch
Users that are interested in genie2-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆176Updated last year
- RS-IMLE☆41Updated 8 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆102Updated last year
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆112Updated 4 months ago
- Official PyTorch implementation of TokenSet.☆121Updated 4 months ago
- A Video Tokenizer Evaluation Dataset☆129Updated 6 months ago
- My take on Flow Matching☆69Updated 6 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆102Updated 8 months ago
- ☆52Updated 10 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆279Updated last year
- Implementation of the proposed MaskBit from Bytedance AI☆82Updated 8 months ago
- ☆133Updated 7 months ago
- [ECCV 2024, Oral] FMBoost: Boosting Latent Diffusion with Flow Matching☆241Updated 8 months ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆234Updated 2 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆165Updated 2 months ago
- ☆70Updated 8 months ago
- ☆58Updated 3 weeks ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆71Updated 10 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆129Updated 6 months ago
- Exploring Diffusion Transformer Designs via Grafting☆48Updated last month
- Inference-time scaling of diffusion-based image and video generation models.☆161Updated last month
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆51Updated 6 months ago
- Train VAE like a boss☆287Updated 9 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆401Updated 6 months ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆139Updated 6 months ago
- Official implementation of Inductive Moment Matching☆529Updated 3 weeks ago
- DDT: Decoupled Diffusion Transformer☆268Updated last month
- Explorations into improving ViTArc with Slot Attention☆42Updated 9 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆56Updated 2 months ago
- Focused on fast experimentation and simplicity☆76Updated 7 months ago