FoundationAgents / AutoEnvLinks
Scaling Agentic Environments Automatically.
☆47Updated last week
Alternatives and similar repositories for AutoEnv
Users that are interested in AutoEnv are comparing it to the libraries listed below
Sorting:
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- ☆46Updated 3 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆290Updated 2 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆148Updated 4 months ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆120Updated 2 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆102Updated 4 months ago
- ☆223Updated 3 weeks ago
- MemEvolve & EvolveLab☆148Updated last month
- Scaling Long-Horizon LLM Agent via Context-Folding☆101Updated this week
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 7 months ago
- ☆144Updated 8 months ago
- The official GitHub page for the survey paper "A Survey on LLM Symbolic Reasoning". And this paper is under review.☆22Updated last week
- [FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆138Updated this week
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 7 months ago
- ☆108Updated last month
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆26Updated last week
- Official implementation of MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems☆73Updated 7 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆27Updated 2 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆64Updated 2 weeks ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆57Updated this week
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47Updated 8 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆304Updated 3 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆64Updated 3 months ago
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆72Updated 2 months ago
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆49Updated 3 weeks ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆124Updated 7 months ago
- ☆132Updated 3 weeks ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆120Updated 8 months ago
- ☆177Updated last month
- ☆186Updated 3 months ago