mattiasxu / Video-VQVAE
VQVAE for video prediction
☆26Updated 2 years ago
Alternatives and similar repositories for Video-VQVAE:
Users that are interested in Video-VQVAE are comparing it to the libraries listed below
- ElasticTok: Adaptive Tokenization for Image and Video☆49Updated 2 months ago
- ☆43Updated 4 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆27Updated 10 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆46Updated 6 months ago
- ☆66Updated this week
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆63Updated 10 months ago
- Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆56Updated 3 months ago
- ☆48Updated 4 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆45Updated 2 years ago
- ☆53Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆96Updated last week
- ☆35Updated last year
- ☆114Updated last year
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆79Updated last year
- ☆22Updated last month
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated last year
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆32Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025).☆47Updated last week
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 7 months ago
- ☆10Updated last year
- Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition (ICLR 2024)☆33Updated 8 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆92Updated 3 months ago
- [ICML 2022] Official PyTorch implementation of the paper "Unsupervised Image Representation Learning with Deep Latent Particles"☆26Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 4 months ago
- ☆65Updated 6 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆81Updated last year
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆27Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- ☆73Updated 2 years ago