This repository provides the official implementation of VTBench, a benchmark designed to evaluate the performance of visual tokenizers (VTs) in the context of autoregressive (AR) image generation.
☆35Jul 30, 2025Updated 7 months ago
Alternatives and similar repositories for VTBench
Users that are interested in VTBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆153Jul 24, 2025Updated 8 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- official training and inference code of bitwise tokenizer☆71May 18, 2025Updated 10 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆42Feb 12, 2025Updated last year
- [ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion☆24Feb 5, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- Few shot recognition using CLIP's OpenAI architecture.☆36Aug 2, 2021Updated 4 years ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 11 months ago
- ☆28Feb 15, 2026Updated last month
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆21Jan 11, 2026Updated 2 months ago
- [ECCV'24] Self-training Room Layout Estimation via Geometry-aware Ray-casting☆15Jan 20, 2025Updated last year
- ☆12Apr 25, 2025Updated 11 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆216Mar 11, 2026Updated 2 weeks ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code implementation for: From Virtual Games to Real-World Play☆46Jun 23, 2025Updated 9 months ago
- This is an official code for UniConvNet on ICCV 2025☆37Nov 21, 2025Updated 4 months ago
- ☆16Jul 1, 2024Updated last year
- ☆14Oct 10, 2021Updated 4 years ago
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆35Nov 19, 2025Updated 4 months ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Jul 26, 2025Updated 8 months ago
- ☆19Jun 23, 2021Updated 4 years ago
- ☆48Apr 3, 2025Updated 11 months ago
- ☆27Jul 18, 2025Updated 8 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆19Jun 26, 2025Updated 9 months ago
- ☆131Nov 8, 2025Updated 4 months ago
- ICLR 2026-MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning☆36Mar 13, 2026Updated last week
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated last year
- This is an open-source repository based on our paper, primarily applied in the field of remote sensing image compression.☆19May 15, 2024Updated last year
- JoPano: Unified Panorama Generation via Joint Modeling☆24Mar 6, 2026Updated 2 weeks ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated 2 months ago
- Multi-person trajectory dataset in diverse indoor scenes☆14Jan 12, 2026Updated 2 months ago
- ☆13Jan 3, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Jun 28, 2023Updated 2 years ago
- 给科研小白的一些资源与工具推荐☆17Jul 6, 2020Updated 5 years ago
- 基于Detectron2的仰卧起坐识别☆12Dec 1, 2020Updated 5 years ago
- (NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation☆66Oct 14, 2025Updated 5 months ago
- ☆21Sep 8, 2025Updated 6 months ago
- Code for ICML 2021 paper "Regularizing towards Causal Invariance: Linear Models with Proxies" (ICML 2021)☆11Mar 14, 2022Updated 4 years ago
- ☆18Jul 10, 2024Updated last year