mattiasxu / Video-VQVAE
VQVAE for video prediction
☆26Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Video-VQVAE
- ElasticTok: Adaptive Tokenization for Image and Video☆33Updated 2 weeks ago
- ☆43Updated 2 months ago
- Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆55Updated last month
- ☆63Updated last year
- ☆110Updated last year
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆40Updated 4 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆55Updated last month
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆26Updated 8 months ago
- ☆44Updated 2 months ago
- The codebase of our paper "Improving the Training of Rectified Flows"☆82Updated last month
- ☆33Updated 10 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆78Updated 2 weeks ago
- ☆45Updated 7 months ago
- ☆17Updated 9 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 10 months ago
- The official PyTorch implementation of Fast Diffusion Model☆91Updated last year
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆30Updated last week
- ☆31Updated 5 months ago
- Official implementation of "Self-Improving Video Generation"☆52Updated last week
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆94Updated last month
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆58Updated 8 months ago
- ☆48Updated last year
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated last year
- ☆75Updated this week
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆31Updated last year
- ☆17Updated 3 years ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆38Updated 3 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 2 years ago
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆24Updated last year