Terminal-Bench-Science: Evaluating AI Agents on Complex Real-World Scientific Workflows in the Terminal
☆76May 14, 2026Updated last week
Alternatives and similar repositories for terminal-bench-science
Users that are interested in terminal-bench-science are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- READMEs for RxRx.ai datasets☆28Jun 6, 2025Updated 11 months ago
- A fun multiplayer game☆10Jan 5, 2019Updated 7 years ago
- Kinase-Substrate Enrichment Analysis (KSEA) App☆10May 2, 2017Updated 9 years ago
- Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training☆63Jul 28, 2025Updated 9 months ago
- Folk dance event information.☆10Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- R function to plot high quality, elegant heatmap using 'ggplot2' graphics . Some of the important features of this package are, colorin…☆11Apr 19, 2016Updated 10 years ago
- [AAAI'26] Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression☆19Dec 21, 2025Updated 5 months ago
- RapidSMS Twilio Backend☆22Aug 24, 2018Updated 7 years ago
- Synthetic coordinates for GNNs, as proposed in "Directional Message Passing on Molecular Graphs via Synthetic Coordinates" (NeurIPS 2021)☆32Apr 26, 2023Updated 3 years ago
- Prediction of compound synthesis accessibility bashed on reaction knowledge graph☆17May 24, 2022Updated 3 years ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- Self-Supervised Representation Learning (CDpath ICCV 2021)☆13Dec 1, 2022Updated 3 years ago
- Repository of the package Fast Wave☆12May 26, 2025Updated 11 months ago
- ☆14Jul 13, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PlaNet: Predicting population response to drugs via clinical knowledge graph☆32Mar 13, 2025Updated last year
- Sequence to Medical Phenotypes: A pipeline featuring variant annotation, prioritization, pharmacogenomics, and tools for analyzing genomi…☆13Mar 1, 2016Updated 10 years ago
- Madrigal: Multimodal AI predicts clinical outcomes of drug combinations from preclinical data☆41Jul 31, 2025Updated 9 months ago
- Data and analysis scripts for understanding molecular entropies, including conformer flexibility☆13Mar 29, 2021Updated 5 years ago
- ALOFT, the Annotation Of Loss-of-Function Transcripts, provides extensive functional annotations to loss-of-function variants in the hum…☆19Nov 4, 2019Updated 6 years ago
- ☆19Nov 7, 2022Updated 3 years ago
- This repo contains the results data for Round 2 of EGFR Protein Design Competition.☆19Mar 18, 2025Updated last year
- VarGen is an R package designed to get a list of variants related to a disease. It just need an OMIM morbid ID as input and optionally a…☆17Jan 28, 2024Updated 2 years ago
- [ACL 2026] Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration☆22Apr 11, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A template to create your own literature survey engine☆14Updated this week
- Automatically generated and up-to-date datasets for Cobalt.☆10May 16, 2020Updated 6 years ago
- The theory of mind module for the SWE agent☆104May 12, 2026Updated last week
- Interface package for featurizing atomic structures☆42Nov 3, 2023Updated 2 years ago
- Various tools for electoral analysis in Poland☆14Jul 23, 2025Updated 10 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆29Mar 1, 2025Updated last year
- Steering LLM Thinking with Budget Guidance☆30Feb 19, 2026Updated 3 months ago
- ODSC 2023 workshop materials on causal graphs using implementations of DoWhy (PyWhy, EconML)☆13Nov 1, 2023Updated 2 years ago
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆19Dec 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AI Chatbot with your MkDocs Documentation☆14Mar 9, 2026Updated 2 months ago
- Command line image downloader from Google☆18Mar 7, 2023Updated 3 years ago
- ☆29Aug 26, 2025Updated 8 months ago
- Zeobuilder is an extensible GUI-toolkit for molecular model construction.☆13Feb 15, 2019Updated 7 years ago
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)☆39Oct 8, 2025Updated 7 months ago
- ☆15Aug 5, 2022Updated 3 years ago
- ☆18Nov 30, 2025Updated 5 months ago