lucidrains / genie2-pytorchLinks
Implementation of a framework for Genie2 in Pytorch
☆149Updated 6 months ago
Alternatives and similar repositories for genie2-pytorch
Users that are interested in genie2-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆175Updated last year
- A Video Tokenizer Evaluation Dataset☆127Updated 6 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆110Updated 3 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆102Updated last year
- My take on Flow Matching☆66Updated 6 months ago
- Official PyTorch implementation of TokenSet.☆121Updated 3 months ago
- ☆129Updated 6 months ago
- RS-IMLE☆41Updated 7 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆101Updated 8 months ago
- Benchmarking physical understanding in generative video models☆182Updated last month
- Focused on fast experimentation and simplicity☆76Updated 6 months ago
- ☆56Updated 3 months ago
- ☆52Updated 9 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆82Updated 8 months ago
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆169Updated 4 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆82Updated 6 months ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆224Updated last week
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆213Updated last month
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆110Updated last month
- [Preprint] UCGM: Unified Continuous Generative Models☆161Updated last month
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆278Updated 11 months ago
- Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆368Updated this week
- Official implementation of Inductive Moment Matching☆499Updated 4 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆128Updated 5 months ago
- ☆120Updated 4 months ago
- Inference-time scaling of diffusion-based image and video generation models.☆156Updated 2 weeks ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 8 months ago
- ☆70Updated 7 months ago
- [ArXiv 2025] WorldMem: Long-term Consistent World Simulation with Memory☆176Updated last month
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆51Updated 5 months ago