Official repository for the UAE paper, unified-GRPO, and unified-Bench
☆158Sep 12, 2025Updated 5 months ago
Alternatives and similar repositories for UAE
Users that are interested in UAE are comparing it to the libraries listed below
Sorting:
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 3 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆851Dec 23, 2025Updated 2 months ago
- [AAAI26] Next Patch Prediction☆132Jan 2, 2025Updated last year
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆178Feb 24, 2026Updated last week
- WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction☆61Sep 3, 2025Updated 6 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆187Nov 6, 2025Updated 4 months ago
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆45Jan 18, 2025Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆309Oct 12, 2025Updated 4 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆78Feb 13, 2026Updated 3 weeks ago
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆152Jan 27, 2026Updated last month
- ☆190Dec 17, 2024Updated last year
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Feb 27, 2025Updated last year
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated last year
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"