PKU-YuanGroup/UniSandBox

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKU-YuanGroup/UniSandBox)

PKU-YuanGroup / UniSandBox

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

☆60

Alternatives and similar repositories for UniSandBox

Users that are interested in UniSandBox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PKU-YuanGroup / WISE
View on GitHub
[ICML 2026🔥] WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
☆212Jun 26, 2026Updated last month
WayneJin0918 / SRUM
View on GitHub
[ECCV 2026🔥] SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models
☆93Nov 26, 2025Updated 8 months ago
Tencent-Hunyuan / GEAR
View on GitHub
☆65Jul 1, 2026Updated 3 weeks ago
facebookresearch / GenEval2
View on GitHub
Evaluation codes and data for GenEval2
☆80Jan 8, 2026Updated 6 months ago
cheryyunl / ROVER
View on GitHub
Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
☆27Dec 12, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
PKU-YuanGroup / OSP-Next
View on GitHub
OSP-Next
☆68Jun 22, 2026Updated last month
PKU-YuanGroup / N-LoRA
View on GitHub
【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".
☆38Dec 5, 2024Updated last year
PKU-YuanGroup / ImgEdit
View on GitHub
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
☆330Nov 5, 2025Updated 8 months ago
DAGroup-PKU / SpatialT2I
View on GitHub
[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling
☆86Mar 2, 2026Updated 4 months ago
PKU-YuanGroup / UAE
View on GitHub
Official repository for the UAE paper, unified-GRPO, and unified-Bench
☆166Sep 12, 2025Updated 10 months ago
PKU-YuanGroup / AsFT
View on GitHub
Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".
☆37Jul 10, 2025Updated last year
PKU-YuanGroup / HoloTime
View on GitHub
[ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation
☆159Sep 4, 2025Updated 10 months ago
Purshow / Awesome-Unified-Multimodal
View on GitHub
📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.
☆365Jan 8, 2026Updated 6 months ago
SuperFCR / E-4DGS
View on GitHub
[🔥ACMMM 2025] mplemetation of "E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras"
☆19Aug 14, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PKU-YuanGroup / Look-Back
View on GitHub
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".
☆100Jul 10, 2025Updated last year
Osilly / Interleaving-Reasoning-Generation
View on GitHub
[ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…
☆100Jan 26, 2026Updated 6 months ago
arctanxarc / GENIUS
View on GitHub
☆43May 9, 2026Updated 2 months ago
IDEA-XL / ChemCoTBench
View on GitHub
LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry
☆55Oct 9, 2025Updated 9 months ago
PKU-YuanGroup / UniWorld
View on GitHub
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
☆886Dec 23, 2025Updated 7 months ago
QC-LY / UiG
View on GitHub
Code for "Understanding-in-Generation:Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation"
☆15Nov 11, 2025Updated 8 months ago
nnnth / UniLIP
View on GitHub
[ICLR 2026 🔥 ] Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"
☆151Jan 26, 2026Updated 6 months ago
CaraJ7 / DraCo
View on GitHub
Offical Repository for Paper: DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
☆17Dec 7, 2025Updated 7 months ago
Fr0zenCrane / UniCoT
View on GitHub
[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
☆234May 31, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
para-lost / ECHO
View on GitHub
Echo: "Constantly Improving Image Models Need Constantly Improving Benchmarks" (ICLR 2026)
☆20Jan 29, 2026Updated 6 months ago
PKU-YuanGroup / Edit-R1
View on GitHub
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆295Jan 24, 2026Updated 6 months ago
PKU-YuanGroup / OpenS2V-Nexus
View on GitHub
[NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation
☆223May 19, 2026Updated 2 months ago
PLUM-Lab / R2I-Bench
View on GitHub
☆18Mar 14, 2026Updated 4 months ago
PKU-YuanGroup / LLMBind
View on GitHub
LLMBind: A Unified Modality-Task Integration Framework
☆19Jun 16, 2024Updated 2 years ago
qiujihao19 / Artemis
View on GitHub
[NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos
☆27Apr 8, 2025Updated last year
HorizonWind2004 / reconstruction-alignment
View on GitHub
[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…
☆411May 23, 2026Updated 2 months ago
HowardLi1984 / ECDFormer
View on GitHub
【Nature Computational Science 2025🔥】Deep peak property learning for efficient chiral molecules ECD spectra prediction
☆51Jan 12, 2025Updated last year
multimodal-reasoning-lab / Bagel-Zebra-CoT
View on GitHub
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆137Jan 30, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Tencent-Hunyuan / iFSQ
View on GitHub
iFSQ & LlamaGen-REPA
☆103Jan 27, 2026Updated 6 months ago
PKU-YuanGroup / Next-Patch-Prediction
View on GitHub
[AAAI26] Next Patch Prediction
☆129Jan 2, 2025Updated last year
Vchitect / Uni-MMMU
View on GitHub
[ACL2026 oral] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark
☆26Apr 13, 2026Updated 3 months ago
wendell0218 / Janus-Pro-R1
View on GitHub
[NeurIPS 2025] Official repository of the paper "Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Compreh…
☆23Sep 27, 2025Updated 10 months ago
micky-li-hd / CoCo
View on GitHub
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
☆54Apr 9, 2026Updated 3 months ago
YuqingWang1029 / TokenBridge
View on GitHub
[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…
☆158Jul 24, 2025Updated last year
wyhlovecpp / GPT-Image-Edit
View on GitHub
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
☆243Aug 15, 2025Updated 11 months ago