allenai / codescientistLinks
CodeScientist: An automated scientific discovery system for code-based experiments
☆300Updated 4 months ago
Alternatives and similar repositories for codescientist
Users that are interested in codescientist are comparing it to the libraries listed below
Sorting:
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆233Updated last week
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆233Updated 8 months ago
- ☆271Updated 7 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆200Updated 2 months ago
- Together Open Deep Research☆352Updated 7 months ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆118Updated last week
- OpenResearcher, an advanced Scientific Research Assistant☆473Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆447Updated 2 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆248Updated last month
- II-Researcher: a new open-source framework designed to aid building search / research agents☆477Updated 3 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆348Updated 4 months ago
- ☆355Updated 3 months ago
- ☆79Updated last month
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆230Updated 6 months ago
- Open AI data scientist agent that automates complex data analysis tasks using the ReAct framework. Execute Python code locally or in the …☆178Updated 4 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆488Updated this week
- Repository for Zochi's Research☆283Updated 2 months ago
- ☆172Updated 8 months ago
- ⚖️ Awesome LLM Judges ⚖️☆133Updated 6 months ago
- This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.☆730Updated 3 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆477Updated 2 months ago
- ☆180Updated last week
- An agent benchmark with tasks in a simulated software company.☆581Updated last month
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆122Updated 9 months ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆596Updated 5 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆266Updated last month
- 🤗 Benchmark Large Language Models Reliably On Your Data☆410Updated last month
- ☆567Updated 6 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆105Updated 7 months ago
- ☆182Updated 9 months ago