huiwon-jang / CoordTok
☆22Updated 2 weeks ago
Alternatives and similar repositories for CoordTok:
Users that are interested in CoordTok are comparing it to the libraries listed below
- ElasticTok: Adaptive Tokenization for Image and Video☆43Updated 2 months ago
- ☆113Updated last year
- ☆42Updated last week
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆27Updated 10 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆31Updated 6 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆91Updated 2 months ago
- ☆48Updated 3 months ago
- ☆10Updated last year
- Stable Consistency Tuning: Understanding and Improving Consistency models☆15Updated last month
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆81Updated 11 months ago
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆77Updated this week
- ☆66Updated last month
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆92Updated 2 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆39Updated 6 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆22Updated last month
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆80Updated 2 months ago
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆31Updated 2 months ago
- A Video Tokenizer Evaluation Dataset☆79Updated last week
- ☆121Updated 3 weeks ago
- Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition (ICLR 2024)☆30Updated 7 months ago
- Official implementation of "Self-Improving Video Generation"☆57Updated 2 weeks ago
- ☆20Updated 6 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆45Updated last month
- ☆21Updated last month
- ☆43Updated 4 months ago
- ☆44Updated 9 months ago
- ☆12Updated 2 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆55Updated 2 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆45Updated 6 months ago