[ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"
☆51Jan 30, 2026Updated last month
Alternatives and similar repositories for VPPO-RL
Users that are interested in VPPO-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆79Feb 27, 2026Updated last month
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding☆45Mar 16, 2026Updated last week
- AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management☆25Mar 17, 2026Updated last week
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆41Oct 9, 2025Updated 5 months ago
- ☆17Aug 7, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆55Mar 17, 2026Updated last week
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Jan 12, 2026Updated 2 months ago
- 基于yolov5s进行的暴力行为检测☆18Jul 2, 2024Updated last year
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆39Nov 26, 2025Updated 4 months ago
- ☆21Dec 14, 2024Updated last year
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆44Aug 7, 2025Updated 7 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 4 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2026] HiconAgent: History Context-aware Policy Optimization for GUI Agents☆27Mar 9, 2026Updated 2 weeks ago
- Python codes for mathematical modeling.☆12Sep 5, 2021Updated 4 years ago
- ☆12Aug 8, 2024Updated last year
- Support finetuning GLM4v with zero2☆16Jun 29, 2024Updated last year
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆27Feb 10, 2026Updated last month
- ☆11Sep 19, 2025Updated 6 months ago
- ☆29Oct 8, 2025Updated 5 months ago
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆46Mar 16, 2026Updated last week
- CoV: Chain-of-View Prompting for Spatial Reasoning☆52Jan 23, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- Official implementation of "PathReasoner-R1: Instilling Structured Reasoning into Pathology Vision-Language Model via Knowledge-Guided Po…☆22Jan 28, 2026Updated 2 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- Mixture of Lora Experts☆10Apr 7, 2024Updated last year
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆63Feb 28, 2026Updated last month
- ☆14Jul 17, 2025Updated 8 months ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- Medical Imaging Benchmarks for Out-Of-Distribution Detection☆45Mar 19, 2026Updated last week
- ☆57Dec 23, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆14Jan 7, 2025Updated last year
- Code release for VTW (AAAI 2025 Oral)☆66Nov 4, 2025Updated 4 months ago
- [BMVC 2022] Information Theoretic Representation Distillation☆19Oct 6, 2023Updated 2 years ago
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆285Mar 21, 2026Updated last week
- ☆15May 6, 2021Updated 4 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago