allenai / codescientistLinks
CodeScientist: An automated scientific discovery system for code-based experiments
☆289Updated 2 months ago
Alternatives and similar repositories for codescientist
Users that are interested in codescientist are comparing it to the libraries listed below
Sorting:
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆230Updated 6 months ago
- ☆266Updated 4 months ago
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆225Updated last week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆370Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆339Updated 2 months ago
- ☆76Updated 6 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆470Updated last month
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆172Updated 3 weeks ago
- Together Open Deep Research☆346Updated 5 months ago
- ☆350Updated last month
- ☆553Updated 4 months ago
- OpenResearcher, an advanced Scientific Research Assistant☆465Updated 11 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆585Updated 3 months ago
- An agent benchmark with tasks in a simulated software company.☆546Updated 3 weeks ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆223Updated 4 months ago
- Repository for Zochi's Research☆267Updated 3 weeks ago
- Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning☆384Updated last month
- Open AI data scientist agent that automates complex data analysis tasks using the ReAct framework. Execute Python code locally or in the …☆163Updated 2 months ago
- ☆171Updated 6 months ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆176Updated this week
- 🚀 MassGen: An Open-source Multi-Agent Scaling System Inspired by Grok Heavy and Gemini Deep Think. Join the discord channel: https://dis…☆436Updated this week
- An Open-Source AI Writing Project.☆366Updated last week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆391Updated last week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆458Updated last month
- OSS RL environment + evals toolkit☆161Updated this week
- ⚖️ Awesome LLM Judges ⚖️☆127Updated 4 months ago
- AWM: Agent Workflow Memory☆316Updated 7 months ago
- The State Of The Art, intelligence☆152Updated last month
- Turn topics into essays in seconds!☆187Updated 2 months ago
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆429Updated 2 weeks ago