camel-ai / seta-envLinks
π» SETA: Scaling Environments for Terminal Agents - Environments
β90Updated this week
Alternatives and similar repositories for seta-env
Users that are interested in seta-env are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyondβ191Updated 7 months ago
- [ICLR 2026] Learning to Reason without External Rewardsβ391Updated 2 weeks ago
- SSRL: Self-Search Reinforcement Learningβ206Updated 5 months ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environmentsβ177Updated last month
- β388Updated 3 months ago
- PostTrainBench measures how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hoursβ131Updated last week
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]β216Updated 2 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectβ¦β134Updated last week
- β229Updated 11 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scalingβ182Updated 6 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ125Updated 8 months ago
- RL Scaling and Test-Time Scaling (ICML'25)β113Updated last year
- Data Synthesis for Deep Research Based on Semi-Structured Dataβ198Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ261Updated 9 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]β180Updated 7 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"β256Updated last week
- Process Reward Models That Thinkβ78Updated 2 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"β68Updated 10 months ago
- β90Updated 3 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learningβ149Updated 4 months ago
- A repo for open research on building large reasoning modelsβ136Updated last week
- β135Updated 2 weeks ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoningβ96Updated 2 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolutionβ104Updated 4 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examplesβ120Updated last week
- The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Executionβ219Updated this week
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"β102Updated 5 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike statβ¦β427Updated 2 weeks ago
- accompanying material for sleep-time compute paperβ119Updated 9 months ago
- [COLM2025] "Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors"β55Updated 4 months ago