A Video Tokenizer Evaluation Dataset
☆156Jan 13, 2025Updated last year
Alternatives and similar repositories for TokenBench
Users that are interested in TokenBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A suite of image and video neural tokenizers☆1,722Feb 11, 2025Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,003Nov 25, 2025Updated 5 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆91Nov 4, 2024Updated last year
- a family of versatile and state-of-the-art video tokenizers.☆445Sep 1, 2025Updated 8 months ago
- ☆52Dec 13, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.☆323Jul 9, 2024Updated last year
- Evaluation codes and data for GenEval2☆69Jan 8, 2026Updated 3 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 3 months ago
- [CVPR 2023] Spatial-then-Temporal Self-Supervised Learning for Video Correspondence☆11Jul 5, 2023Updated 2 years ago
- [NeurlPS-2024] The official code of MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models☆77Jan 9, 2026Updated 3 months ago
- ☆38Feb 6, 2025Updated last year
- [TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"☆149Nov 14, 2024Updated last year
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆442Jan 6, 2026Updated 3 months ago
- This repo contains the code for 1D tokenizer and generator☆1,145Mar 20, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆191Dec 17, 2024Updated last year
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆405Jan 19, 2025Updated last year
- High-performance Image Tokenizers for VAR and AR☆306Apr 25, 2025Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆149Feb 11, 2025Updated last year
- New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos☆8,097Jan 6, 2026Updated 3 months ago
- ☆144Jun 28, 2024Updated last year
- A unified inference and post-training framework for accelerated video generation.☆3,446Updated this week
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,314Aug 7, 2025Updated 8 months ago
- [ECCV 2024] Official PyTorch implementation of Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation☆12Nov 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆459Aug 8, 2025Updated 8 months ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,904Feb 20, 2026Updated 2 months ago
- Next-Token Prediction is All You Need☆2,399Jan 12, 2026Updated 3 months ago
- Official implementation of BLIP3o-Series☆1,648Nov 29, 2025Updated 5 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,948Aug 15, 2024Updated last year
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆47Jun 13, 2024Updated last year
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆137Jan 29, 2026Updated 3 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆184Mar 20, 2025Updated last year
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆371Oct 5, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆104Feb 11, 2025Updated last year
- ☆214Feb 11, 2025Updated last year
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆14Apr 1, 2025Updated last year
- ☆131Feb 22, 2025Updated last year
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆133Jul 31, 2025Updated 9 months ago
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆997Jan 17, 2024Updated 2 years ago
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆937Jan 6, 2026Updated 3 months ago