facebookresearch / aira-dojoLinks
AIRA-dojo: a framework for developing and evaluating AI research agents
☆124Updated last week
Alternatives and similar repositories for aira-dojo
Users that are interested in aira-dojo are comparing it to the libraries listed below
Sorting:
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 6 months ago
- ☆88Updated 3 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆344Updated last month
- Ideas for projects related to Tinker☆147Updated 2 months ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆138Updated last month
- ☆112Updated last year
- Code for "Reasoning to Learn from Latent Thoughts"☆124Updated 10 months ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆175Updated 2 weeks ago
- ☆117Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 10 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆357Updated this week
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆343Updated 2 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆114Updated 6 months ago
- ☆93Updated last week
- Replicating O1 inference-time scaling laws☆91Updated last year
- ☆33Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆152Updated 11 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆127Updated 3 months ago
- ☆99Updated last year
- ☆227Updated 11 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆184Updated 8 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- A Gym for Agentic LLMs☆437Updated last week
- ☆85Updated this week
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆226Updated this week
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆156Updated last month
- Training API and CLI☆323Updated this week
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆174Updated 4 months ago
- PostTrainBench measures how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours☆118Updated last week
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Updated last year