LargeWorldModel / ElasticTok
ElasticTok: Adaptive Tokenization for Image and Video
☆31Updated last week
Related projects ⓘ
Alternatives and complementary repositories for ElasticTok
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 9 months ago
- ☆108Updated last year
- ☆30Updated 2 weeks ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆58Updated 8 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆78Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated last month
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆40Updated 4 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆66Updated this week
- The codebase of our paper "Improving the Training of Rectified Flows"☆80Updated 3 weeks ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆26Updated 8 months ago
- ☆64Updated 4 months ago
- ☆44Updated last month
- ☆59Updated 4 months ago
- ☆16Updated 8 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆54Updated last month
- [arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆83Updated 5 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆30Updated 6 months ago
- ☆44Updated 2 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆39Updated 4 months ago
- Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition (ICLR 2024)☆27Updated 5 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆31Updated last week
- Official implementation of "Self-Improving Video Generation"☆48Updated this week
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆88Updated last month
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆52Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆29Updated 4 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆38Updated last week
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆76Updated last month
- ☆10Updated last year
- Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆55Updated 3 weeks ago