wilson1yan / VideoGPT-Paper
☆17Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for VideoGPT-Paper
- VQVAE for video prediction☆26Updated 2 years ago
- ElasticTok: Adaptive Tokenization for Image and Video☆33Updated 2 weeks ago
- ☆75Updated this week
- ☆110Updated last year
- Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆55Updated last month
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆55Updated last month
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 2 years ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 10 months ago
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆34Updated last month
- ☆43Updated 2 months ago
- [ECCV 2024] Code for "EraseDraw: Learning to Insert Objects by Erasing Them from Images"☆17Updated 3 months ago
- ☆13Updated 4 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆75Updated 7 months ago
- Procedural Image Programs for Representation Learning - NeurIPS 2022☆31Updated last month
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆78Updated 2 weeks ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆40Updated 4 months ago
- Official implementation of "Self-Improving Video Generation"☆52Updated last week
- ☆21Updated 4 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆26Updated 8 months ago
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆44Updated 8 months ago
- Official implementation of PartSTAD: 2D-to-3D Part Segmentation Task Adaptation (ECCV 2024).☆16Updated 2 weeks ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆58Updated 8 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆30Updated last week
- ☆47Updated 2 months ago
- [NeurIPS 2023] InsActor: Instruction-driven Physics-based Characters☆133Updated 7 months ago
- ☆63Updated last year
- ☆48Updated last year
- ☆44Updated 2 months ago