Vero: An Open RL Recipe for General Visual Reasoning
☆129Jun 19, 2026Updated last week
Alternatives and similar repositories for vero
Users that are interested in vero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- On Policy Distillation Build on top of Verl☆87May 25, 2026Updated last month
- ☆12Jul 4, 2024Updated last year
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆38Apr 25, 2026Updated 2 months ago
- ☆17Jun 10, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 10 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- [CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"☆202Mar 19, 2026Updated 3 months ago
- ☆15Apr 25, 2025Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 6 months ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- ☆18Aug 1, 2024Updated last year
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆36Oct 26, 2025Updated 8 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆194Feb 24, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence☆117May 11, 2026Updated last month
- ☆24May 23, 2025Updated last year
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆165Sep 12, 2025Updated 9 months ago
- [ECCV 2026] Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training☆226Jun 19, 2026Updated last week
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆38Apr 7, 2025Updated last year
- NTU SC2002 Group Project - Final Year Project Management System (FYPMS)☆20Aug 12, 2025Updated 10 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆25May 7, 2025Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆30Oct 19, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆57Sep 29, 2025Updated 9 months ago
- The official implementation of InfoRM [NeurIPS 2024].☆16Oct 25, 2025Updated 8 months ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆63Dec 26, 2025Updated 6 months ago
- ☆19Jun 29, 2025Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 6 months ago
- Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"☆27Jun 28, 2023Updated 3 years ago
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"☆64Mar 23, 2026Updated 3 months ago
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated last year
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆163Mar 30, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆38Mar 23, 2023Updated 3 years ago
- ☆218Jun 15, 2026Updated 2 weeks ago
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated last year
- [ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…☆462Apr 7, 2026Updated 2 months ago
- [Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances☆20Jan 15, 2026Updated 5 months ago
- Multi-modal Sarcasm Detection and Humor Classification in Code-mixed Conversations☆13May 31, 2021Updated 5 years ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year