This repository provides the official implementation of VTBench, a benchmark designed to evaluate the performance of visual tokenizers (VTs) in the context of autoregressive (AR) image generation.
☆35Jul 30, 2025Updated 7 months ago
Alternatives and similar repositories for VTBench
Users that are interested in VTBench are comparing it to the libraries listed below
Sorting:
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆152Jul 24, 2025Updated 7 months ago
- official training and inference code of bitwise tokenizer☆69May 18, 2025Updated 9 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- ICLR 2026-MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning☆35Feb 9, 2026Updated 3 weeks ago
- Code implementation for: From Virtual Games to Real-World Play☆46Jun 23, 2025Updated 8 months ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Dec 23, 2022Updated 3 years ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Feb 12, 2025Updated last year
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆186Nov 6, 2025Updated 3 months ago
- [ECCV'24] Self-training Room Layout Estimation via Geometry-aware Ray-casting☆15Jan 20, 2025Updated last year
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- A Genetic Algorithms framework for Hadoop MapReduce.☆10May 30, 2018Updated 7 years ago
- ☆48Apr 3, 2025Updated 11 months ago
- Large Language Models Powered Context-aware Motion Prediction☆14Jan 12, 2026Updated last month
- ☆12Apr 25, 2025Updated 10 months ago
- Meetup Call for Paper pour soumettre vos idées de sujets☆11Nov 1, 2017Updated 8 years ago
- A custom open ai gym environment for solo experimentation.☆12Apr 14, 2021Updated 4 years ago
- An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"☆14Feb 23, 2026Updated last week
- Unofficial implementation of 'Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator'☆10Dec 10, 2024Updated last year
- ☆12Dec 9, 2025Updated 2 months ago
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Jan 26, 2024Updated 2 years ago
- UVA-Human-Skeleton-Preprocessing☆10May 4, 2023Updated 2 years ago
- ICME'19: Removing Rain in Videos: A Large-scale Database and A Two-stream ConvLSTM Approach☆12Jul 4, 2022Updated 3 years ago
- Web application interface for Mathematica☆11Mar 28, 2015Updated 10 years ago
- Advances in recent large vision language models (LVLMs)☆15Sep 23, 2024Updated last year
- Submission Under Review☆17May 15, 2025Updated 9 months ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- ☆118Nov 8, 2025Updated 3 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆211Jan 27, 2026Updated last month
- 给科研小白的一些资源与工具推荐☆17Jul 6, 2020Updated 5 years ago
- ☆14Oct 10, 2021Updated 4 years ago
- ☆11May 14, 2025Updated 9 months ago
- A standards-first framework with a unified approach to building fullstack web apps.☆11Jul 1, 2025Updated 8 months ago
- Training Vision Transformers for Semi-Supervised Semantic Segmentation☆14Nov 3, 2025Updated 4 months ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 4 months ago
- This is code for How Do Social Bots Participate in Misinformation Spread? A Comprehensive Dataset and Analysis☆14Nov 5, 2025Updated 3 months ago
- Mathematics, Algorithmic, Data-Science, Teaching Materials☆13Jan 18, 2026Updated last month
- Multi-person trajectory dataset in diverse indoor scenes☆13Jan 12, 2026Updated last month
- ☆28Feb 15, 2026Updated 2 weeks ago
- A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving a…☆190Updated this week