duckietown / gym-duckietown-agent
This is the template for the gym agent.
☆11Updated 6 years ago
Alternatives and similar repositories for gym-duckietown-agent:
Users that are interested in gym-duckietown-agent are comparing it to the libraries listed below
- Public accompanying repository for Universite de Montreal's IFT 6757: Autnonomous Vehicles, Fall 2019.☆11Updated 2 years ago
- PhD Publications and Thesis on LASSO Model Predictive Control☆20Updated 5 years ago
- ☆13Updated 3 years ago
- Boiler plate code for Torch based ML projects☆10Updated 3 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- Pytorch-based python library for continuous reinforcement learning and imitation learning [superseded by @osudrl/apex]☆13Updated 4 years ago
- Complementary material to EAAAI18 Paper "Mighty Thymio for Higher-Level Robotics Education"☆18Updated last year
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆10Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- ☆45Updated 5 years ago
- NeurIPS 2018: AI for Prosthetics Challenge – 3rd place solution☆32Updated 5 years ago
- Backprop training of recurrent neural networks with Hebbian plastic connections☆20Updated 3 years ago
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- Codebase for Efficient yet simple Reinforcement Learning Research Framework☆28Updated 2 years ago
- ☆17Updated 7 years ago
- A collection of notebooks aiding the understanding of machine-learning papers.☆10Updated 3 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆34Updated 2 years ago
- ☆29Updated 6 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆11Updated 3 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library☆15Updated 6 years ago
- ☆42Updated 4 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Simple change of a3c to a2c