apple / ml-flextok
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆132Updated last month
Alternatives and similar repositories for ml-flextok
Users that are interested in ml-flextok are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆113Updated 3 months ago
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆167Updated last month
- An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆170Updated this week
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆88Updated last week
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆190Updated 3 weeks ago
- Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆145Updated 3 weeks ago
- ☆159Updated 4 months ago
- DDT: Decoupled Diffusion Transformer☆233Updated 3 weeks ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆95Updated last month
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆102Updated 2 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆118Updated 2 months ago
- ☆184Updated 3 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆203Updated last week
- TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge☆108Updated last week
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆151Updated last month
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆156Updated 3 weeks ago
- Pixel-Space Generative Models☆197Updated last week
- Code for paper "Principal Components" Enable A New Language of Images☆40Updated 3 weeks ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆140Updated 2 months ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆136Updated 4 months ago
- Official PyTorch implementation of TokenSet.☆118Updated last month
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆68Updated 7 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆68Updated 6 months ago
- Official PyTorch Implementation of "History-Guided Video Diffusion"☆306Updated 2 months ago
- ☆70Updated 5 months ago
- The official implementation of "[MASK] is All You Need"☆116Updated 2 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆118Updated 3 months ago
- FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)☆226Updated 5 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆112Updated 6 months ago
- ☆54Updated last month