A highly-customisable gridworld game engine with some batteries included. Make your own gridworld games to test reinforcement learning agents!
☆663Sep 6, 2019Updated 6 years ago
Alternatives and similar repositories for pycolab
Users that are interested in pycolab are comparing it to the libraries listed below
Sorting:
- This is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.☆629May 18, 2022Updated 3 years ago
- Simple and easily configurable grid world environments for reinforcement learning☆2,412Mar 2, 2026Updated 2 weeks ago
- A customisable 3D platform for agent-based AI research☆7,341Jan 4, 2023Updated 3 years ago
- Code for the paper "Emergent Complexity via Multi-agent Competition"☆833Apr 2, 2023Updated 2 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆975Jan 11, 2019Updated 7 years ago
- NIPS 2017 Value Prediction Network☆167Jan 12, 2018Updated 8 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,021Mar 13, 2019Updated 7 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,048Jun 10, 2023Updated 2 years ago
- Spriteworld: a flexible, configurable python-based reinforcement learning environment☆371Jun 1, 2020Updated 5 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,122Oct 13, 2017Updated 8 years ago
- bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent☆1,531Apr 13, 2024Updated last year
- TensorFlow Reinforcement Learning☆3,136Dec 8, 2022Updated 3 years ago
- DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.☆2,169Apr 2, 2023Updated 2 years ago
- A TensorFlow implementation of the Differentiable Neural Computer.☆2,528Jul 23, 2021Updated 4 years ago
- PlayGround: AI Research into Multi-Agent Learning.☆781Dec 19, 2023Updated 2 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,472Dec 7, 2022Updated 3 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆618Jul 6, 2023Updated 2 years ago
- Tensorforce: a TensorFlow library for applied reinforcement learning☆3,312Jul 31, 2024Updated last year
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Apr 13, 2023Updated 2 years ago
- An End-To-End, Lightweight and Flexible Platform for Game Research☆2,093Aug 30, 2021Updated 4 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆94Apr 17, 2018Updated 7 years ago
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,418Apr 16, 2024Updated last year
- ☆120Jul 9, 2020Updated 5 years ago
- Tensorflow implementation of "The Predictron: End-To-End Learning and Planning"☆291Jan 20, 2017Updated 9 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- A customizable framework to create maze and gridworld environments☆269Apr 5, 2019Updated 6 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆721May 12, 2024Updated last year
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,668Aug 1, 2024Updated last year
- Deep Reinforcement Learning with pytorch & visdom☆804Jul 16, 2020Updated 5 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- A customisable 2D platform for agent-based AI research☆439Oct 5, 2023Updated 2 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,882May 29, 2022Updated 3 years ago
- Reinforcement Learning in PyTorch☆2,274Jan 4, 2021Updated 5 years ago
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.☆10,854Nov 4, 2024Updated last year
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆186Nov 1, 2017Updated 8 years ago