Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."
☆65Feb 25, 2026Updated 3 months ago
Alternatives and similar repositories for PyVision-RL
Users that are interested in PyVision-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding☆47Feb 28, 2026Updated 3 months ago
- ☆17Sep 11, 2025Updated 8 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆34Updated this week
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆69Mar 17, 2026Updated 2 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆63Mar 25, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official PyTorch implementation of the paper Transformer-Based Image Generation from Scene Graphs https://arxiv.org/abs/2303.04634☆19Jan 30, 2024Updated 2 years ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆55Oct 21, 2025Updated 7 months ago
- ☆15Oct 12, 2024Updated last year
- ☆24Jan 24, 2026Updated 4 months ago
- 聚焦海量面经检索、简历分析与模拟面试的 AI 求职准备平台☆139Mar 30, 2026Updated 2 months ago
- ☆11Mar 11, 2025Updated last year
- ☆27Feb 3, 2026Updated 4 months ago
- ☆42Jun 9, 2025Updated 11 months ago
- Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models"☆217May 21, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆42Jan 27, 2026Updated 4 months ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆81Apr 7, 2026Updated 2 months ago
- Official repository of SoftREPA: Aligning Text to Image in Diffusion Models is Easier Than You Think☆23Jun 5, 2025Updated last year
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆32Mar 17, 2026Updated 2 months ago
- Codebase for EnterpriseOps-Gym from ServiceNow☆93May 30, 2026Updated last week
- InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models☆108Apr 20, 2026Updated last month
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆21Dec 14, 2025Updated 5 months ago
- ☆18May 18, 2026Updated 3 weeks ago
- ☆10Dec 3, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Sep 19, 2025Updated 8 months ago
- ☆31Oct 8, 2025Updated 8 months ago
- CVPR 2026 (Highlight)-Guiding a Diffusion Transformer with the Internal Dynamics of Itself (IG)☆72Apr 9, 2026Updated 2 months ago
- [CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆82May 12, 2026Updated 3 weeks ago
- 🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.☆81May 2, 2026Updated last month
- [ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models☆71May 15, 2025Updated last year
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆79Feb 9, 2026Updated 3 months ago
- ☆13Jul 3, 2024Updated last year
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Jul 17, 2025Updated 10 months ago
- [ACL 2026 Main] Revisit What You See: Revealing Visual Semantics in Vision Tokens to Guide LVLM Decoding☆25Nov 21, 2025Updated 6 months ago
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 5 months ago
- ☆12May 15, 2025Updated last year
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".☆67Updated this week
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- ☆50Jul 19, 2025Updated 10 months ago