iglu-contest / gridworldLinks
A reinforcement learning environment for the IGLU 2022 at NeurIPS
☆34Updated 2 years ago
Alternatives and similar repositories for gridworld
Users that are interested in gridworld are comparing it to the libraries listed below
Sorting:
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- Knowledge-Aware RL agents with Commonsense Reasoning☆78Updated 3 years ago
- ☆56Updated 11 months ago
- ☆54Updated 4 years ago
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆53Updated last year
- ☆20Updated 3 years ago
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- ☆19Updated 2 years ago
- ☆37Updated last year
- Super fast implementations of common benchmark text world games☆51Updated 2 months ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Grounded SCAN data set.☆70Updated 3 years ago
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Updated 4 years ago
- ☆57Updated 3 years ago
- Nethack Learning Environment Wrapper for Language Interface☆40Updated 2 years ago
- Code and data for Learning Rewards from Linguistic Feedback, AAAI '21☆10Updated 4 years ago
- Sandbox environment for generalizable agent research☆25Updated 3 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆209Updated 2 years ago
- Phy-Q: A Testbed for Physical Reasoning☆45Updated last year
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆33Updated 4 years ago
- ☆17Updated last year
- An environment for benchmarking commonsense agents☆29Updated 5 years ago
- ☆28Updated 3 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆69Updated 2 years ago
- A framework for experimenting with never-ending learning☆79Updated last year
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆33Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆88Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆43Updated 11 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago