Danau5tin / Orca-Agent-RLLinks
Scaling Coding-Agent RL to 32x H100s. **Achieving 160% improvement** on Stanford's TerminalBench
☆82Updated 3 weeks ago
Alternatives and similar repositories for Orca-Agent-RL
Users that are interested in Orca-Agent-RL are comparing it to the libraries listed below
Sorting:
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆269Updated last month
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆486Updated last week
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆64Updated this week
- Verifiers for LLM Reinforcement Learning☆78Updated 2 months ago
- Demo for using copilotkit with the ada-middleware from ag-ui☆70Updated last month
- DSPy module for OpenAI Codex SDK - signature-driven agentic workflows☆138Updated 3 weeks ago
- An OpenSource Deep Research library with reasoning☆165Updated this week
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆166Updated 2 months ago
- ☆188Updated this week
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆369Updated 2 months ago
- How to build the best search, one step at a time!☆82Updated this week
- An Automatic Prompt Optimization Framework for Large Language Models☆137Updated 3 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 4 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆39Updated last month
- ☆107Updated 3 weeks ago
- It takes a village to raise a child: Google DeepThink 🧠 but in LangGraph and free - an original algorithm for collaborative agents using…☆130Updated 2 months ago
- MCP-based Agent Deep Evaluation System☆138Updated last month
- Context Engineering Course with DSPy☆202Updated 3 months ago
- VeritasGraph: Enterprise-Grade Graph RAG for Secure, On-Premise AI with Verifiable Attribution☆179Updated this week
- ☆84Updated 2 weeks ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆233Updated last week
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆268Updated last week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆82Updated 8 months ago
- ☆43Updated 2 weeks ago
- OmniDaemon is a Universal Event-Driven Runtime for AI Agents, it's framework-agnostic, event-driven runtime that turns AI agents into pr…☆41Updated last week
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆148Updated last month
- Interactive command-line chat application powered by Langchain, Langgraph, Prompt Toolkit and Rich☆131Updated this week
- ☆35Updated 3 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆449Updated 2 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year