NVlabs / TokenBench
A Video Tokenizer Evaluation Dataset
☆107Updated 2 months ago
Alternatives and similar repositories for TokenBench:
Users that are interested in TokenBench are comparing it to the libraries listed below
- ElasticTok: Adaptive Tokenization for Image and Video☆61Updated 4 months ago
- ☆121Updated 2 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆108Updated last month
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆110Updated last month
- ☆116Updated last month
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆100Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆68Updated 3 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆61Updated 3 weeks ago
- [CVPR2025] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project/☆127Updated this week
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆143Updated last week
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆101Updated this week
- ☆191Updated last month
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆87Updated last week
- Benchmarking physical understanding in generative video models☆137Updated 3 weeks ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆49Updated 8 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆56Updated last month
- Official PyTorch Implementation of "History-Guided Video Diffusion"☆233Updated 2 weeks ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆82Updated last month
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆40Updated last month
- ☆47Updated 3 months ago
- ☆146Updated 3 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆103Updated 5 months ago
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆246Updated 2 months ago
- ☆43Updated last month
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 2 years ago
- ☆27Updated last month
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆177Updated last month
- Official implementation of "Self-Improving Video Generation"☆62Updated 3 weeks ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆112Updated last month
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆86Updated 5 months ago