wilson1yan / teco
☆110Updated last year
Related projects ⓘ
Alternatives and complementary repositories for teco
- ElasticTok: Adaptive Tokenization for Image and Video☆32Updated 2 weeks ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆70Updated 2 weeks ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 10 months ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆71Updated 3 months ago
- Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆55Updated last month
- ☆113Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆55Updated last month
- ☆17Updated 9 months ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆84Updated 2 years ago
- VQVAE for video prediction☆26Updated 2 years ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆78Updated last year
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆100Updated last year
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆93Updated last month
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆58Updated 8 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆31Updated 6 months ago
- ☆43Updated 2 months ago
- ☆72Updated 2 years ago
- [arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆84Updated 5 months ago
- ☆68Updated 2 months ago
- Codebase for HiP☆87Updated 11 months ago
- Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).☆304Updated 6 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆42Updated 4 months ago
- ☆195Updated last month
- ☆64Updated 4 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆55Updated last month
- ☆44Updated last month
- Official Code for Neural Systematic Binder☆29Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated last month
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆76Updated last week