Danau5tin / Orca-Agent-RLLinks
Scaling Coding-Agent RL to 32x H100s. **Achieving 160% improvement** on Stanford's TerminalBench
☆91Updated 2 months ago
Alternatives and similar repositories for Orca-Agent-RL
Users that are interested in Orca-Agent-RL are comparing it to the libraries listed below
Sorting:
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆272Updated 3 months ago
- Data recipes and robust infrastructure for training AI agents☆84Updated this week
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆93Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆548Updated 2 weeks ago
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆40Updated 3 months ago
- Digital Red Queen: Adversarial Program Evolution in Core War with LLMs☆158Updated 2 weeks ago
- ☆159Updated last month
- Deep research agents using MiniMax M2.1 interleaved thinking☆194Updated last month
- Verifiers for LLM Reinforcement Learning☆81Updated 4 months ago
- OmniDaemon is a Universal Event-Driven Runtime for AI Agents, it's framework-agnostic, event-driven runtime that turns AI agents into pr…☆48Updated last month
- LongCodeZip: Compress Long Context for Code Language Models [ASE2025]☆137Updated 2 months ago
- ☆95Updated last week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆88Updated 10 months ago
- ☆266Updated last week
- ☆43Updated 2 months ago
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆225Updated 3 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 5 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆251Updated 2 months ago
- The theory of mind module for the SWE agent☆68Updated 2 weeks ago
- An OpenSource Deep Research library with reasoning☆170Updated last month
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆249Updated 3 weeks ago
- ☆107Updated 2 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆115Updated last month
- DSPy module for OpenAI Codex SDK - signature-driven agentic workflows☆151Updated last month
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 6 months ago
- A method for steering llms to better follow instructions☆76Updated 5 months ago
- MCP-based Agent Deep Evaluation System☆142Updated 4 months ago
- [ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆324Updated this week
- The State Of The Art, intelligence☆157Updated 5 months ago