wilson1yan / VideoGPT-Paper
☆18Updated 4 years ago
Alternatives and similar repositories for VideoGPT-Paper
Users that are interested in VideoGPT-Paper are comparing it to the libraries listed below
Sorting:
- VQVAE for video prediction☆27Updated 3 years ago
- ElasticTok: Adaptive Tokenization for Image and Video☆67Updated 6 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆86Updated last year
- ☆118Updated 2 months ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆65Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆119Updated 3 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆32Updated last year
- ☆73Updated 2 years ago
- ☆45Updated 2 months ago
- Codebase of Truncated Consistency Models (ICLR 2025)☆23Updated 3 months ago
- ☆30Updated 3 months ago
- ☆10Updated last year
- A Video Tokenizer Evaluation Dataset☆115Updated 4 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- ☆75Updated 10 months ago
- 👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"☆26Updated 6 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆114Updated 3 months ago
- [NeurIPS 2021] Code for Learning Signal-Agnostic Manifolds of Neural Fields☆68Updated 2 years ago
- ☆48Updated last year
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆84Updated 2 years ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆77Updated last year
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆84Updated last year
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆51Updated last year
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆112Updated 7 months ago
- ☆68Updated 3 months ago
- Collaborative Score Distillation for Consistent Visual Synthesis (NeurIPS 2023)☆117Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆68Updated 3 months ago
- [ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition☆37Updated last year
- ☆31Updated 4 months ago