open-thoughts/OpenThoughts-Agent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/open-thoughts/OpenThoughts-Agent)

open-thoughts / OpenThoughts-Agent

Data recipes and robust infrastructure for training AI agents

☆260

Alternatives and similar repositories for OpenThoughts-Agent

Users that are interested in OpenThoughts-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kanishkg / endless-terminals
View on GitHub
☆134Mar 31, 2026Updated 3 months ago
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,085Updated this week
abundant-ai / SWE-gen
View on GitHub
Convert GitHub PRs into Harbor tasks
☆72Jul 13, 2026Updated last week
harbor-framework / terminal-bench-3
View on GitHub
Measuring agents' ability to get work done on a computer
☆329Updated this week
hamishivi / tmax
View on GitHub
Training terminal-agents
☆238Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
harbor-framework / harbor
View on GitHub
Framework for evaluating and improving agents
☆3,348Updated this week
hkust-nlp / Toolathlon
View on GitHub
[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
☆432Updated this week
OpenHands / codescout
View on GitHub
Repo for collaboration on OSS agentic code search
☆70Apr 29, 2026Updated 2 months ago
Danau5tin / terminal-bench-rl
View on GitHub
GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…
☆396Aug 24, 2025Updated 10 months ago
open-thoughts / OpenThoughts-TBLite
View on GitHub
A Difficulty-Calibrated Benchmark for Building Terminal Agents
☆27Feb 20, 2026Updated 5 months ago
harbor-framework / terminal-bench
View on GitHub
A benchmark for LLMs on complicated tasks in the terminal
☆2,472Jul 11, 2026Updated last week
Danau5tin / tbench-agentic-data-pipeline
View on GitHub
Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training
☆70Jul 28, 2025Updated 11 months ago
cmu-l3 / gym-anything
View on GitHub
Gym-Anything: Turn any Software into an Agent Environment
☆263Jul 14, 2026Updated last week
harbor-framework / terminal-bench-challenges
View on GitHub
☆18Jun 18, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
NVIDIA-NeMo / ProRL-Agent-Server
View on GitHub
Agentic RL on Any Harness at Scale
☆696Jul 15, 2026Updated last week
ltzheng / SimpleTIR
View on GitHub
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆401Mar 30, 2026Updated 3 months ago
Factory-AI / legacy-bench
View on GitHub
Legacy-Bench: A benchmark for evaluating AI agents on legacy software engineering tasks
☆18Apr 2, 2026Updated 3 months ago
zlab-princeton / llm-distillation-jax
View on GitHub
JAX implementation of configurable LLM distillation training
☆24Nov 15, 2025Updated 8 months ago
PrimeIntellect-ai / lab-cookbook
View on GitHub
Lab Cookbook
☆37Updated this week
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
scaleapi / SWE-Atlas
View on GitHub
open source SWE-Atlas
☆55Updated this week
LARK-AI-Lab / EnvFactory
View on GitHub
The official paper for EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL.
☆85Jun 5, 2026Updated last month
mlfoundations / evalchemy
View on GitHub
Automatic evals for LLMs
☆600Feb 24, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LAION-AI / scaling-laws-for-comparison
View on GitHub
☆22May 12, 2026Updated 2 months ago
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,569Updated this week
R2E-Gym / R2E-Gym
View on GitHub
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
☆308Jul 13, 2025Updated last year
aisa-group / PostTrainBench
View on GitHub
Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours
☆463Updated this week
lili-chen / rltf
View on GitHub
Reinforcement Learning from Text Feedback
☆49Feb 17, 2026Updated 5 months ago
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,021Updated this week
harbor-framework / terminal-bench-2
View on GitHub
☆340Apr 30, 2026Updated 2 months ago
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,017Jul 1, 2026Updated 2 weeks ago
xlang-ai / CUA-Gym
View on GitHub
Scalable pipeline for synthesizing verifiable RLVR training data for computer-use agents
☆179May 26, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Gen-Verse / GenEnv
View on GitHub
GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators
☆62Dec 23, 2025Updated 6 months ago
eigent-ai / toolathlon_gym
View on GitHub
Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.
☆139Apr 2, 2026Updated 3 months ago
snap-research / CoSearch
View on GitHub
CoSearch: Joint Training of Reasoning and Document Ranking via Reinforcement Learning for Agentic Search
☆15Apr 28, 2026Updated 2 months ago
camel-ai / seta
View on GitHub
💻 SETA: Scaling Environments for Terminal Agents
☆125Updated this week
SWE-bench / SWE-smith
View on GitHub
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
☆710Updated this week
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆228Nov 27, 2025Updated 7 months ago
SWE-Gym / SWE-Gym
View on GitHub
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆708Jul 29, 2025Updated 11 months ago