Cominclip/OmniVerifier

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cominclip/OmniVerifier)

Cominclip / OmniVerifier

[ICLR 2026 Oral & ICML 2026] Generative Universal Verifier as Multimodal Meta-Reasoner

☆64

Alternatives and similar repositories for OmniVerifier

Users that are interested in OmniVerifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qishisuren123 / AnyCap
View on GitHub
A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…
☆54Jul 24, 2025Updated last year
eternal8080 / MV-MATH
View on GitHub
Description for MV-MATH
☆15Jul 20, 2025Updated last year
cheryyunl / ROVER
View on GitHub
Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
☆27Dec 12, 2025Updated 7 months ago
zss02 / BiPS
View on GitHub
[CVPR 2026] See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
☆22Jun 28, 2026Updated last month
alchemistyzz / PeRL
View on GitHub
[NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"
☆30Mar 30, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ShareLab-SII / UniAR
View on GitHub
[ICML 2026] The official implementation of paper "Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key …
☆46Jul 13, 2026Updated 2 weeks ago
zhengdian1 / AIA
View on GitHub
☆45Jan 4, 2026Updated 6 months ago
G-U-N / UniRL
View on GitHub
[ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models
☆91May 26, 2026Updated 2 months ago
wj-inf / MagicMirror
View on GitHub
Official impl. of "MagicMirror: A Large-Scale Dataset and Benchmark for Fine-Grained Artifacts Assessment in Text-to-Image Generation"
☆24Sep 15, 2025Updated 10 months ago
arctanxarc / GENIUS
View on GitHub
☆43May 9, 2026Updated 2 months ago
GongyeLiu / StyleCrafter-SDXL
View on GitHub
Code of StyleCrafter on SDXL
☆20Jun 25, 2024Updated 2 years ago
zsgvivo / VideoZoomer
View on GitHub
☆34Feb 12, 2026Updated 5 months ago
ali-vilab / DreamVideo-Omni
View on GitHub
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
☆16May 27, 2026Updated 2 months ago
QC-LY / UiG
View on GitHub
Code for "Understanding-in-Generation:Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation"
☆15Nov 11, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jiyt17 / ReDiff
View on GitHub
Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'
☆45Jun 27, 2026Updated last month
UMass-Embodied-AGI / Mirage
View on GitHub
[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
☆294Aug 2, 2025Updated 11 months ago
KaiyueSun98 / T2I-ReasonBench
View on GitHub
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
☆38Sep 16, 2025Updated 10 months ago
ZiyuGuo99 / Thinking-while-Generating
View on GitHub
The first Interleaved framework for textual reasoning within the visual generation process
☆165Mar 16, 2026Updated 4 months ago
PKU-YuanGroup / UniSandBox
View on GitHub
Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward
☆60Nov 27, 2025Updated 8 months ago
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
EndoluminalSurgicalVision-IMR / DGCI
View on GitHub
[MICCAI 2024] Implicit Representation Embraces Challenging Attributes of Pulmonary Airway Tree Structures
☆14Nov 13, 2024Updated last year
TencentARC / MindOmni
View on GitHub
[NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
☆139Oct 15, 2025Updated 9 months ago
casiatao / LPO
View on GitHub
The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.
☆19May 22, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Fr0zenCrane / UniCoT
View on GitHub
[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
☆234May 31, 2026Updated last month
LunarShen / DsicoVLA
View on GitHub
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
☆22Jun 23, 2025Updated last year
showlab / UniRL
View on GitHub
The code repository of UniRL
☆53May 30, 2025Updated last year
shawn0728 / Unify-Agent
View on GitHub
🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.
☆86May 2, 2026Updated 2 months ago
OneIG-Bench / OneIG-Benchmark
View on GitHub
[NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…
☆120Feb 10, 2026Updated 5 months ago
Zeqing-Wang / VideoVerse
View on GitHub
Official Repo for the VideoVerse
☆15Mar 29, 2026Updated 4 months ago
HorizonWind2004 / reconstruction-alignment
View on GitHub
[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…
☆411May 23, 2026Updated 2 months ago
WayneJin0918 / SRUM
View on GitHub
[ECCV 2026🔥] SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models
☆93Nov 26, 2025Updated 8 months ago
NOVAglow646 / Monet
View on GitHub
[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"
☆215Mar 19, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mingluo-su / ROSE
View on GitHub
[CPAL 2026 oral] Offical implementation of "ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models Pruning”
☆16Apr 21, 2026Updated 3 months ago
SihengLi99 / LLM-Honesty-Survey
View on GitHub
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆66Dec 8, 2024Updated last year
microsoft / PixelCraft
View on GitHub
[ICLR 2026] High-Fidelity Visual Reasoning on Structured Images
☆30Jul 17, 2026Updated last week
lavinal712 / Awesome-Visual-Tokenizers
View on GitHub
📖 This is a repository for organizing papers, codes and other resources related to visual tokenizers.
☆17Jul 7, 2026Updated 3 weeks ago
Osilly / Interleaving-Reasoning-Generation
View on GitHub
[ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…
☆100Jan 26, 2026Updated 6 months ago
yuli0103 / LayoutDiT
View on GitHub
LayoutDiT: Exploring Content-Graphic Balance in Layout Generation with Diffusion Transformer
☆49Jan 6, 2026Updated 6 months ago
AfterJourney00 / mmd_to_smpl
View on GitHub
An automated workflow for composing, rendering, and retargeting MMD assets.
☆16Feb 23, 2026Updated 5 months ago