Image Tokenizer Needs Post-Training
☆24Oct 4, 2025Updated 5 months ago
Alternatives and similar repositories for RobusTok
Users that are interested in RobusTok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆77Jul 30, 2025Updated 7 months ago
- ☆13Sep 2, 2023Updated 2 years ago
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 6 months ago
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- ☆15May 4, 2025Updated 10 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆246Oct 12, 2025Updated 5 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆180Feb 24, 2026Updated 3 weeks ago
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆175Updated this week
- ☆20Dec 8, 2024Updated last year
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆77Sep 19, 2025Updated 6 months ago
- [CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space☆24Mar 15, 2026Updated last week
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year
- ☆22Sep 26, 2024Updated last year
- ☆20Nov 14, 2022Updated 3 years ago
- Synthetic Alphabet Dataset☆19Mar 27, 2025Updated 11 months ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆143Jan 13, 2025Updated last year
- ☆30Mar 30, 2025Updated 11 months ago
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated last month
- This is the official implementation for ControlVAR.☆126Dec 10, 2024Updated last year
- https://github.com/xie-lab-ml/Golden-Noise-for-Diffusion-Models for ComfyUI☆18Dec 10, 2024Updated last year
- ☆29Jun 9, 2025Updated 9 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆171May 1, 2025Updated 10 months ago
- ☆18Dec 8, 2024Updated last year
- ☆33Dec 20, 2023Updated 2 years ago
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Jul 27, 2025Updated 7 months ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- High-performance Image Tokenizers for VAR and AR☆303Apr 25, 2025Updated 10 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆39Aug 23, 2025Updated 7 months ago
- Subjects200K dataset☆130Jan 17, 2025Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- ☆10Sep 26, 2019Updated 6 years ago
- Stability-AI's SV3D (ECCV 2024 oral, Voleti et al.) in the diffusers convention.☆32Feb 5, 2025Updated last year