AIRA-dojo: a framework for developing and evaluating AI research agents
☆132Mar 12, 2026Updated last week
Alternatives and similar repositories for aira-dojo
Users that are interested in aira-dojo are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆30Dec 15, 2025Updated 3 months ago
- A Model Context Protocol (MCP) server that provides access to the DBLP computer science bibliography database for Large Language Models.☆25Dec 31, 2025Updated 2 months ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Jul 4, 2022Updated 3 years ago
- Implementation of SOAR☆51Sep 17, 2025Updated 6 months ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- ☆55Updated this week
- Cost-aware Bayesian optimization via the Pandora's box Gittins index☆14Aug 8, 2025Updated 7 months ago
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆13Nov 15, 2023Updated 2 years ago
- LLM as World Models using Bayesian inference☆16May 27, 2025Updated 9 months ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆35Jun 28, 2024Updated last year
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆591Aug 10, 2025Updated 7 months ago
- TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆26Feb 5, 2026Updated last month
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025☆22Mar 6, 2026Updated 2 weeks ago
- Molecular Set Representation Learning☆51Jul 16, 2025Updated 8 months ago
- ☆23Apr 5, 2023Updated 2 years ago
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated last year
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆27Feb 17, 2026Updated last month
- Discovering Data-driven Hypotheses in the Wild☆136Jun 9, 2025Updated 9 months ago
- Examples of robotic manipulation using DeepMind's MuJoCo framework.☆14Aug 13, 2024Updated last year
- Code for the "Evolving Reservoirs for Meta Reinforcement Learning" paper☆11Apr 22, 2024Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆31Apr 1, 2025Updated 11 months ago
- ☆91Oct 30, 2025Updated 4 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated 11 months ago
- ☆17Mar 3, 2025Updated last year
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- ☆33Jul 9, 2025Updated 8 months ago
- Deep universal probabilistic programming with Python and PyTorch☆12Apr 1, 2020Updated 5 years ago
- code for BINOCULARS and Multi-Step BO☆12Dec 7, 2020Updated 5 years ago
- ☆11Mar 17, 2024Updated 2 years ago
- Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".☆17Feb 3, 2023Updated 3 years ago
- Genetics for Language Models☆17Jul 1, 2024Updated last year
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆34Aug 23, 2025Updated 6 months ago
- ☆12May 19, 2023Updated 2 years ago
- [NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents☆73Mar 10, 2026Updated last week
- Goal-conditioned reinforcement learning like 🔥☆14Feb 3, 2024Updated 2 years ago
- PreferenceNet: Encoding Human Preferences in Auction Design With Deep Learning☆17Aug 10, 2021Updated 4 years ago