Implementation of TRPO and related algorithms
☆650May 20, 2018Updated 7 years ago
Alternatives and similar repositories for modular_rl
Users that are interested in modular_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆99Aug 15, 2016Updated 9 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,058Jun 10, 2023Updated 2 years ago
- PyTorch implementation of Trust Region Policy Optimization☆451Sep 13, 2018Updated 7 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Jun 2, 2020Updated 5 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Asynchronous Methods for Deep Reinforcement Learning☆588Aug 9, 2018Updated 7 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆730Nov 22, 2018Updated 7 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆347Nov 22, 2018Updated 7 years ago
- ☆18Apr 25, 2016Updated 10 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆407Feb 25, 2017Updated 9 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆214Feb 16, 2018Updated 8 years ago
- TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper☆551Mar 7, 2019Updated 7 years ago
- Guided Policy Search☆601Feb 9, 2021Updated 5 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Jul 20, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"☆1,007Mar 18, 2018Updated 8 years ago
- DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.☆2,170Apr 2, 2023Updated 3 years ago
- Testbed for deep reinforcement learning☆162Jun 12, 2017Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- trust region policy optimization base on gym and tensorflow, can run in distribution mode☆15May 6, 2017Updated 9 years ago
- Tensorforce: a TensorFlow library for applied reinforcement learning☆3,308Jul 31, 2024Updated last year
- A starter agent that can solve a number of universe environments.☆1,103Apr 7, 2018Updated 8 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆976Jan 11, 2019Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆178Apr 7, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"☆1,629Oct 31, 2019Updated 6 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,700Aug 1, 2024Updated last year
- Reinforcement Learning with Deep Energy-Based Policies☆437Nov 28, 2023Updated 2 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,481Dec 7, 2022Updated 3 years ago
- NIPS 2017 Value Prediction Network☆168Jan 12, 2018Updated 8 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,124Oct 13, 2017Updated 8 years ago
- KEras Reinforcement Learning gYM agents☆291Jul 8, 2017Updated 8 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,898May 29, 2022Updated 3 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,425Nov 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20Apr 27, 2016Updated 10 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- Deep Reinforcement Learning for Keras.☆5,554Sep 17, 2023Updated 2 years ago
- ChainerRL is a deep reinforcement learning library built on top of Chainer.☆1,200Aug 10, 2021Updated 4 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- Theano-based implementation of Deep Q-learning☆1,095Apr 14, 2017Updated 9 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 9 years ago