Agentic RL on Any Harness at Scale
☆558Jun 13, 2026Updated this week
Alternatives and similar repositories for ProRL-Agent-Server
Users that are interested in ProRL-Agent-Server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆44Mar 31, 2025Updated last year
- [ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆32Dec 9, 2025Updated 6 months ago
- ☆39Jan 9, 2026Updated 5 months ago
- ☆27Oct 10, 2024Updated last year
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Live evaluation of trading agents☆159Feb 17, 2026Updated 4 months ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆26Apr 26, 2026Updated last month
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 7 months ago
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 9 months ago
- Hands-On Image Processing with Python, Second Edition, Published by Packt☆30Updated this week
- ☆31Sep 12, 2025Updated 9 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆23Feb 16, 2025Updated last year
- A MCP Task Server☆11Mar 7, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Computer Environments Elicit General Agentic Intelligence in LLMs☆235May 29, 2026Updated 2 weeks ago
- ☆34Jul 15, 2025Updated 11 months ago
- ☆22Jul 23, 2025Updated 10 months ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆60Feb 6, 2026Updated 4 months ago
- [CVPR 2026] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆92Jun 5, 2026Updated last week
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Cluster Document for IIL@HIT☆20Apr 5, 2023Updated 3 years ago
- ☆64Mar 30, 2026Updated 2 months ago
- CLI for fetching LangSmith data☆118May 26, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A modern X11 server written from scratch in Rust.☆342Updated this week
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆27Dec 21, 2025Updated 5 months ago
- Repository containing code for visualizing Ant Colony Optimization algorithms for clustering☆21Jun 23, 2015Updated 10 years ago
- ☆32Jun 12, 2025Updated last year
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated last year
- Give Claude Code a cheap coworker. CLI tools that delegate bulk I/O to cheap LLMs (Kimi, DeepSeek, Ollama). Save 60-70% of your token bud…☆152Jun 10, 2026Updated last week
- The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''☆113Aug 15, 2025Updated 10 months ago
- ☆65Jul 11, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MCP server for the Delinea Secret Server and Platform APIs☆46Jun 2, 2026Updated 2 weeks ago
- ☆15Apr 8, 2024Updated 2 years ago
- Interactive Social Media Simulation of Believable Human Proxies☆14Dec 23, 2025Updated 5 months ago
- ☆10Jan 20, 2024Updated 2 years ago
- ☆33Oct 21, 2025Updated 7 months ago
- ☆74Mar 3, 2026Updated 3 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆68Apr 3, 2026Updated 2 months ago