Vero: An Open RL Recipe for General Visual Reasoning
β121Apr 19, 2026Updated last month
Alternatives and similar repositories for vero
Users that are interested in vero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25 π SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expertβ¦β16Feb 4, 2025Updated last year
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decodingβ36Apr 25, 2026Updated 3 weeks ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learningβ36Aug 28, 2025Updated 8 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Modelβ37Nov 27, 2024Updated last year
- β14Apr 25, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR trainingβ54Dec 13, 2025Updated 5 months ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Modelsβ36Nov 3, 2024Updated last year
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)β33Oct 26, 2025Updated 6 months ago
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligenceβ109May 11, 2026Updated last week
- This is a framework for evaluating reasoning in foundational Video Models.β95May 5, 2026Updated 2 weeks ago
- β24May 23, 2025Updated 11 months ago
- Official repository for the UAE paper, unified-GRPO, and unified-Benchβ164Sep 12, 2025Updated 8 months ago
- NTU SC2002 Group Project - Final Year Project Management System (FYPMS)β18Aug 12, 2025Updated 9 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Modelsβ57Sep 29, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β53Aug 22, 2025Updated 8 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMsβ24May 7, 2025Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructionsβ32Oct 9, 2025Updated 7 months ago
- The official implementation of InfoRM [NeurIPS 2024].β15Oct 25, 2025Updated 6 months ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"β63Dec 26, 2025Updated 4 months ago
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.β13Feb 28, 2024Updated 2 years ago
- β19Jun 29, 2025Updated 10 months ago
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 5 months ago
- β128Jul 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"β63Mar 23, 2026Updated last month
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative APβ¦β14Jun 27, 2025Updated 10 months ago
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipeβ161Mar 30, 2026Updated last month
- PyTorch Implementation for InMaPβ12Oct 28, 2023Updated 2 years ago
- [ICMLβ25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training anβ¦β13Apr 17, 2025Updated last year
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025β17Nov 24, 2024Updated last year
- [Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performancesβ20Jan 15, 2026Updated 4 months ago
- Spatial Aptitude Training for Multimodal Langauge Modelsβ31Feb 8, 2026Updated 3 months ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.β17Feb 15, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"β19Mar 10, 2025Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."β52Oct 19, 2024Updated last year
- [ICML2026] ARLArenaβ77May 2, 2026Updated 2 weeks ago
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/β¦β10Jun 21, 2023Updated 2 years ago
- SAM 2++: Tracking Anything at Any Granularityβ63Dec 15, 2025Updated 5 months ago
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fiβ¦β86Oct 29, 2023Updated 2 years ago
- Code for "Contrast then Memorize: Semantic Neighbor Retrieval-Enhanced Inductive Multimodal Knowledge Graph Completion", SIGIR 2024.β14Feb 20, 2025Updated last year