lukaslaobeyer / token-optView external linksLinks
Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"
☆201Jun 10, 2025Updated 8 months ago
Alternatives and similar repositories for token-opt
Users that are interested in token-opt are comparing it to the libraries listed below
Sorting:
- This repo contains the code for 1D tokenizer and generator☆1,109Mar 20, 2025Updated 10 months ago
- High-performance Image Tokenizers for VAR and AR☆302Apr 25, 2025Updated 9 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆243Oct 12, 2025Updated 4 months ago
- WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction☆60Sep 3, 2025Updated 5 months ago
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆151Jul 24, 2025Updated 6 months ago
- [NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting☆70Jan 9, 2026Updated last month
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆121Mar 4, 2025Updated 11 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 5 months ago
- ☆304May 29, 2025Updated 8 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆144Feb 11, 2025Updated last year
- HIVE: Evaluating the Human Interpretability of Visual Explanations (ECCV 2022)☆21Jan 19, 2023Updated 3 years ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 8 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆88Apr 10, 2025Updated 10 months ago
- DDT: Decoupled Diffusion Transformer☆361Aug 22, 2025Updated 5 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆992Nov 25, 2025Updated 2 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆98Feb 11, 2025Updated last year
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆290Jun 2, 2025Updated 8 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆206Jul 14, 2025Updated 6 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆172Dec 17, 2025Updated last month
- ☆18Mar 2, 2025Updated 11 months ago
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆26Jul 26, 2025Updated 6 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆164Jan 31, 2025Updated last year
- [ICLR 2026] Autoregressive Image Generation with Randomized Parallel Decoding☆86Jan 27, 2026Updated 2 weeks ago
- Lowering PyTorch's Memory Consumption for Selective Differentiation☆12Aug 29, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆433Aug 8, 2025Updated 6 months ago
- Code for Text + Sketch: Image Compression at Ultra Low Rates☆49Aug 25, 2025Updated 5 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated 11 months ago
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆280Oct 28, 2025Updated 3 months ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆201Apr 29, 2025Updated 9 months ago
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆19Jun 10, 2025Updated 8 months ago
- Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"☆48Oct 15, 2025Updated 3 months ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,859Sep 27, 2024Updated last year
- HallE-Control: Controlling Object Hallucination in LMMs☆31Apr 10, 2024Updated last year
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆92Jul 4, 2025Updated 7 months ago
- Towards Scalable Pre-training of Visual Tokenizers for Generation☆440Dec 16, 2025Updated last month
- Adapting LLaMA Decoder to Vision Transformer☆30May 20, 2024Updated last year
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆56Sep 16, 2025Updated 4 months ago
- Masked Autoencoder meets GANs☆31Dec 27, 2023Updated 2 years ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,544Mar 16, 2025Updated 10 months ago