Vero: An Open RL Recipe for General Visual Reasoning
β122Jun 3, 2026Updated this week
Alternatives and similar repositories for vero
Users that are interested in vero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25 π SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expertβ¦β16Feb 4, 2025Updated last year
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learningβ36Aug 28, 2025Updated 9 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Modelβ37Nov 27, 2024Updated last year
- [CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"β191Mar 19, 2026Updated 2 months ago
- β14Apr 25, 2025Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR trainingβ53Dec 13, 2025Updated 5 months ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Modelsβ36Nov 3, 2024Updated last year
- β18Aug 1, 2024Updated last year
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)β34Oct 26, 2025Updated 7 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"β193Feb 24, 2026Updated 3 months ago
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligenceβ112May 11, 2026Updated 3 weeks ago
- β24May 23, 2025Updated last year
- Official repository for the UAE paper, unified-GRPO, and unified-Benchβ165Sep 12, 2025Updated 8 months ago
- Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Trainingβ188May 5, 2026Updated last month
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.β38Apr 7, 2025Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructionsβ32Oct 9, 2025Updated 8 months ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.β29Oct 19, 2025Updated 7 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Modelsβ58Sep 29, 2025Updated 8 months ago
- The official implementation of InfoRM [NeurIPS 2024].β15Oct 25, 2025Updated 7 months ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"β63Dec 26, 2025Updated 5 months ago
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.β13Feb 28, 2024Updated 2 years ago
- β19Jun 29, 2025Updated 11 months ago
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"β27Jun 28, 2023Updated 2 years ago
- β128Jul 29, 2024Updated last year
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative APβ¦β14Jun 27, 2025Updated 11 months ago
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipeβ162Mar 30, 2026Updated 2 months ago
- β24May 30, 2024Updated 2 years ago
- PyTorch Implementation for InMaPβ12Oct 28, 2023Updated 2 years ago
- multimodal anomaly detectionβ14Jan 17, 2021Updated 5 years ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"β10Jul 1, 2024Updated last year
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025β18Nov 24, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodalβ¦β452Apr 7, 2026Updated 2 months ago
- [Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performancesβ20Jan 15, 2026Updated 4 months ago
- Spatial Aptitude Training for Multimodal Langauge Modelsβ33Feb 8, 2026Updated 4 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"β19Mar 10, 2025Updated last year
- [ICML2026] ARLArenaβ79May 2, 2026Updated last month
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."β53Oct 19, 2024Updated last year
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/β¦β10Jun 21, 2023Updated 2 years ago