Image Tokenizer Needs Post-Training
☆24Oct 4, 2025Updated 6 months ago
Alternatives and similar repositories for RobusTok
Users that are interested in RobusTok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆12Dec 29, 2024Updated last year
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆78Jul 30, 2025Updated 8 months ago
- ☆13Sep 2, 2023Updated 2 years ago
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 6 months ago
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 9 months ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- ☆15May 4, 2025Updated 11 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆248Oct 12, 2025Updated 6 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆181Feb 24, 2026Updated last month
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆176Mar 18, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆20Dec 8, 2024Updated last year
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆77Sep 19, 2025Updated 6 months ago
- [CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space☆26Mar 15, 2026Updated 3 weeks ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated last year
- ☆22Sep 26, 2024Updated last year
- ☆20Nov 14, 2022Updated 3 years ago
- Synthetic Alphabet Dataset☆19Mar 27, 2025Updated last year
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆145Jan 13, 2025Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆30Mar 30, 2025Updated last year
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated 2 months ago
- This is the official implementation for ControlVAR.☆127Dec 10, 2024Updated last year
- https://github.com/xie-lab-ml/Golden-Noise-for-Diffusion-Models for ComfyUI☆18Dec 10, 2024Updated last year
- ☆29Jun 9, 2025Updated 10 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆171May 1, 2025Updated 11 months ago
- ☆18Dec 8, 2024Updated last year
- ☆35Dec 20, 2023Updated 2 years ago
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Jul 27, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- High-performance Image Tokenizers for VAR and AR☆305Apr 25, 2025Updated 11 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆39Aug 23, 2025Updated 7 months ago
- Subjects200K dataset☆129Jan 17, 2025Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- ☆10Sep 26, 2019Updated 6 years ago
- [3DV 2024] Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization☆34Mar 17, 2025Updated last year