cheryyunl/ROVER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cheryyunl/ROVER)

cheryyunl / ROVER

Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

☆26

Alternatives and similar repositories for ROVER

Users that are interested in ROVER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sen-ye / R3
View on GitHub
[ICLR26] Understanding VS. Generation: Navigating Optimization Dilemma in Multimodal Models
☆25May 6, 2026Updated 2 months ago
Vchitect / Uni-MMMU
View on GitHub
[ACL2026 oral] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark
☆25Apr 13, 2026Updated 3 months ago
tianyi-lab / TSRBench
View on GitHub
[ICML 2026] TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models
☆25Mar 24, 2026Updated 3 months ago
WayneJin0918 / SRUM
View on GitHub
[ECCV 2026🔥] SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models
☆93Nov 26, 2025Updated 7 months ago
Cominclip / OmniVerifier
View on GitHub
[ICLR 2026 Oral & ICML 2026] Generative Universal Verifier as Multimodal Meta-Reasoner
☆64May 29, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Fr0zenCrane / UniCoT
View on GitHub
[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
☆233May 31, 2026Updated last month
jiahao-shao1 / openclaw-setup
View on GitHub
☆16Mar 8, 2026Updated 4 months ago
liy1shu / FlowBotHD
View on GitHub
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
☆13Dec 13, 2024Updated last year
PKU-YuanGroup / WISE
View on GitHub
[ICML 2026🔥] WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
☆212Jun 26, 2026Updated 3 weeks ago
jiahao-shao1 / notion-lifeos-skill
View on GitHub
Notion LifeOS PARA system — agent skill for Claude Code, OpenClaw, Codex and more
☆23Mar 24, 2026Updated 3 months ago
AIFrontierLab / UniGame
View on GitHub
[CVPR'26] UniGame code implementation
☆20Apr 21, 2026Updated 2 months ago
QC-LY / UiG
View on GitHub
Code for "Understanding-in-Generation:Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation"
☆15Nov 11, 2025Updated 8 months ago
nssmd / UniG2U
View on GitHub
☆23Apr 2, 2026Updated 3 months ago
HorizonWind2004 / reconstruction-alignment
View on GitHub
[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…
☆410May 23, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SampsonML / DiscoverPhysics
View on GitHub
☆16May 31, 2026Updated last month
Tinaliu0123 / speculative-verdict
View on GitHub
Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation (ICLR 2026)
☆21Apr 27, 2026Updated 2 months ago
Espere-1119-Song / Video-MMLU
View on GitHub
A Massive Multi-Discipline Lecture Understanding Benchmark
☆34Apr 20, 2026Updated 3 months ago
thuml / MiniVeo3-Reasoner
View on GitHub
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…
☆229Apr 13, 2026Updated 3 months ago
kolmogorovArnoldFourierNetwork / kaf_act
View on GitHub
PyTorch implementation of a learnable activation function combining base activation and Random Fourier Features (RFF). This package provi…
☆13Feb 2, 2025Updated last year
sherwinbahmani / threed_front_rendering
View on GitHub
☆13Sep 2, 2023Updated 2 years ago
multimodal-reasoning-lab / Bagel-Zebra-CoT
View on GitHub
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆137Jan 30, 2026Updated 5 months ago
PhoenixZ810 / RISEBench
View on GitHub
[NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing
☆154May 18, 2026Updated 2 months ago
arctanxarc / GENIUS
View on GitHub
☆42May 9, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AIFrontierLab / TorchUMM
View on GitHub
A unified multimodal model toolkit
☆270Jul 2, 2026Updated 2 weeks ago
Osilly / Interleaving-Reasoning-Generation
View on GitHub
[ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…
☆100Jan 26, 2026Updated 5 months ago
KaiyueSun98 / T2I-ReasonBench
View on GitHub
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
☆37Sep 16, 2025Updated 10 months ago
Yu-Fangxu / EACL
View on GitHub
[Findings of NAACL 2024] Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation
☆41Nov 23, 2024Updated last year
technion-cs-nlp / vlm-circuits-analysis
View on GitHub
Code for the experiments and websites of the paper "Same Task, Different Circuits"
☆36Jun 9, 2026Updated last month
renyu2002 / SJTU_SE_CG
View on GitHub
上海交通大学软件学院本科计算机图形学课程代码仓库
☆14Oct 3, 2025Updated 9 months ago
yanghlll / ScalingNoise
View on GitHub
☆41Mar 26, 2025Updated last year
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
ZiyuGuo99 / Thinking-while-Generating
View on GitHub
The first Interleaved framework for textual reasoning within the visual generation process
☆164Mar 16, 2026Updated 4 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
showlab / UniRL
View on GitHub
The code repository of UniRL
☆53May 30, 2025Updated last year
cheryyunl / Make-An-Agent
View on GitHub
☆51Jul 22, 2024Updated last year
NVlabs / SRSA
View on GitHub
SRSA: Skill Retrieval and Adaptation for Robotic Assembly Tasks
☆19Mar 25, 2026Updated 3 months ago
Gen-Verse / WideRange4D
View on GitHub
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes
☆111Mar 19, 2025Updated last year
3dlg-hcvc / s2o
View on GitHub
☆26Feb 11, 2026Updated 5 months ago
compling-wat / vlm-lens
View on GitHub
[EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.
☆122Apr 25, 2026Updated 2 months ago
luccachiang / robots-pretrain-robots
View on GitHub
[ICLR 2025🎉] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Lar…
☆98Jan 22, 2025Updated last year