wilson1yan / VideoGPT-Paper
☆18Updated 3 years ago
Alternatives and similar repositories for VideoGPT-Paper:
Users that are interested in VideoGPT-Paper are comparing it to the libraries listed below
- VQVAE for video prediction☆27Updated 2 years ago
- ☆116Updated last year
- ☆23Updated 2 weeks ago
- ElasticTok: Adaptive Tokenization for Image and Video☆54Updated 3 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆82Updated last year
- ☆51Updated 4 months ago
- ☆43Updated 5 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆47Updated 7 months ago
- A Video Tokenizer Evaluation Dataset☆101Updated last month
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆63Updated 11 months ago
- Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition (ICLR 2024)☆34Updated 9 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆103Updated last week
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆28Updated 11 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆48Updated 2 months ago
- Reading list for research topics in intuitive physics for artificial cognition.☆18Updated 2 years ago
- ☆27Updated last month
- [NeurIPS 2023] Learning Energy-Based Prior Model with Diffusion-Amortized MCMC☆13Updated last year
- ☆14Updated last year
- Codebase of Truncated Consistency Models (ICLR 2025)☆18Updated 3 weeks ago
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 2 years ago
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆50Updated 11 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆50Updated last week
- ☆21Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆35Updated 9 months ago
- ☆10Updated last year
- ☆66Updated 3 weeks ago