(NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation
☆66Oct 14, 2025Updated 5 months ago
Alternatives and similar repositories for VFMTok
Users that are interested in VFMTok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploring Representation-Aligned Latent Space for Better Generation☆18Updated this week
- An innovative method designed to augment the capabilities of existing video diffusion models☆22May 10, 2024Updated last year
- ☆22Mar 7, 2025Updated last year
- [IJCAI 2025] Offical implementation of the paper "Multi-View Learning with Context-Guided Receptance for Image Denoising".☆12Jun 26, 2025Updated 8 months ago
- ☆22Sep 16, 2025Updated 6 months ago
- Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]☆28Mar 13, 2026Updated last week
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 7 months ago
- Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"☆139Oct 17, 2025Updated 5 months ago
- ☆28Sep 19, 2025Updated 6 months ago
- Scale-Equivariant Imaging is a method to deblur and super-resolve a bulk of images by learning from their ensemble statistics☆11Feb 6, 2026Updated last month
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆32Nov 4, 2025Updated 4 months ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆20Nov 17, 2025Updated 4 months ago
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆20Jun 15, 2025Updated 9 months ago
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆14Dec 16, 2024Updated last year
- [TGRS] Continuous urban change detection from satellite image time series☆36Jun 10, 2025Updated 9 months ago
- ☆18Mar 19, 2025Updated last year
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- ☆13Apr 10, 2025Updated 11 months ago
- Adaptive Sparse ViT☆16Aug 1, 2023Updated 2 years ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆21Jan 11, 2026Updated 2 months ago
- [NeurIPS23] PromptRestorer: A Prompting Image Restoration Method with Degradation Perception☆15Aug 4, 2024Updated last year
- ☆14Aug 28, 2024Updated last year
- This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑Language Models via Geom…☆27Mar 15, 2026Updated last week
- SPCL: A New Framework for Domain Adaptive Semantic Segmentation via Semantic Prototype-based Contrastive Learning https://arxiv.org/abs/2…☆11Jun 24, 2022Updated 3 years ago
- The of “Three-Dimension Spatial-Spectral Attention Transformer for Hyperspectral Image Denoising”☆16Sep 30, 2024Updated last year
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- [AAAI 2025] RRT-MVS: Recurrent Regularization Transformer for Multi-View Stereo☆15Nov 4, 2025Updated 4 months ago
- DeepEarth: AI Foundation Model for Planetary Science & Sustainability☆27Mar 10, 2026Updated last week
- Gradient as Conditions: Rethinking HOG for All-in-one Image Restoration☆31Dec 23, 2025Updated 3 months ago
- Leonardo Citraro, Mateusz Kozinski, Pascal Fua, Towards Reliable Evaluation of Road Network Reconstructions, ECCV 2020☆11Aug 21, 2020Updated 5 years ago
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated last year
- The 💩DaBian programming language. 💩"答辩"编程语言, 编程不是💩"答辩"的我不学!☆10Sep 28, 2023Updated 2 years ago
- [ICLR 2026 Oral] Generative Universal Verifier as Multimodal Meta-Reasoner☆54Nov 14, 2025Updated 4 months ago
- PyTorch implementation of ECCV 2024 paper "Confidence-Based Iterative Generation for Real-World Image Super-Resolution"☆16Nov 17, 2024Updated last year
- ☆11Dec 15, 2025Updated 3 months ago
- Official repository for "Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection", ACL Findings 2024.☆15Apr 25, 2025Updated 10 months ago
- CVPR24: Neural Visibility Field for Uncertainty-Driven Active Mapping☆21Dec 25, 2024Updated last year