ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
☆359Dec 3, 2025Updated 5 months ago
Alternatives and similar repositories for ScienceWorld
Users that are interested in ScienceWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Super fast implementations of common benchmark text world games☆53Aug 25, 2025Updated 8 months ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆742Feb 8, 2026Updated 3 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆214Mar 10, 2025Updated last year
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆326Oct 22, 2024Updated last year
- Template-DQN and DRRN agent implementations☆22Jun 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.☆1,412Jan 30, 2026Updated 3 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆145Apr 11, 2024Updated 2 years ago
- [EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games☆73Feb 22, 2021Updated 5 years ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,471Nov 26, 2025Updated 5 months ago
- Knowledge-Aware RL agents with Commonsense Reasoning☆79Mar 4, 2022Updated 4 years ago
- ☆38Jul 17, 2024Updated last year
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆293Aug 3, 2023Updated 2 years ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆540Sep 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A learning environment for man-made Interactive Fiction games.☆324Mar 27, 2026Updated last month
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆415May 20, 2024Updated 2 years ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆149Nov 26, 2024Updated last year
- An extensible benchmark for evaluating large language models on planning☆466Sep 17, 2025Updated 8 months ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆163Oct 30, 2024Updated last year
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆94Feb 8, 2026Updated 3 months ago
- This repository contains the source code of the EMNLP 2020 paper Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehensio…☆20Oct 8, 2020Updated 5 years ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Jan 28, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A library for advanced large language model reasoning☆2,343Jun 10, 2025Updated 11 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆168Dec 17, 2024Updated last year
- The core repository of the elsciRL framework.☆18Dec 8, 2025Updated 5 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆477Mar 19, 2024Updated 2 years ago
- ☆21Aug 18, 2024Updated last year
- Benchmarking the Spectrum of Agent Capabilities☆542Jan 23, 2024Updated 2 years ago
- ☆29Jun 5, 2025Updated 11 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆24Mar 18, 2025Updated last year
- Official Repo of LangSuitE☆84Aug 15, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- API to run VirtualHome, a Multi-Agent Household Simulator☆614Mar 26, 2026Updated last month
- Nethack Learning Environment Wrapper for Language Interface☆42Sep 11, 2023Updated 2 years ago
- ☆23Sep 2, 2024Updated last year
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆784Sep 11, 2025Updated 8 months ago
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,201Mar 18, 2024Updated 2 years ago
- The multi-modal sequence to sequence baseline neural models used in the Grounded SCAN paper.☆16Mar 21, 2021Updated 5 years ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆27Jun 10, 2025Updated 11 months ago