MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
☆19May 24, 2018Updated 7 years ago
Alternatives and similar repositories for deep-reinforcement-learning_DDQN_PPO_HER
Users that are interested in deep-reinforcement-learning_DDQN_PPO_HER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving coordinated (two intersections) transit signal priority on bus travel time and headway reliability with single agent reinforcem…☆14Oct 2, 2021Updated 4 years ago
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24May 3, 2019Updated 7 years ago
- Modification of SOMPY repo with robust K-means clustering (bootstrapped SSE elbow method)☆13Apr 6, 2019Updated 7 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- Code for an optimal velocity model (OVM) and a multiple car following (MCF) model☆11Sep 14, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- introductory homework assignment to help get students set up for the rest of the course☆17Jan 27, 2026Updated 3 months ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆12Nov 1, 2022Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆25Jun 13, 2019Updated 6 years ago
- ☆11Dec 23, 2024Updated last year
- Course webpage for MAT335 at the University of Toronto☆14Apr 3, 2020Updated 6 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Oct 4, 2020Updated 5 years ago
- A synthetic 24 hour traffic scenario for a 45 km section of the German highway A81 between Stuttgart Feuerbach - Heilbronn (Baden-Württem…☆12Oct 5, 2020Updated 5 years ago
- Here is an implementation of some of a few results seen in Early Visual Concept Learning with Unsupervised Deep Learning☆28Oct 2, 2016Updated 9 years ago
- 一些研报的复现☆13Sep 11, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆154Oct 25, 2021Updated 4 years ago
- STM32F103xB Firmware. CMSIS-DAP (USB HID) + 2 x High-speed UART (USB CDC) + UART/SLCAN + USB-I2C☆12Feb 8, 2020Updated 6 years ago
- CATS Lab ACC data is the car-following trajectory dataset including both mix traffic and pure AV traffic.☆11Jan 6, 2023Updated 3 years ago
- Exercises for Web3 MOOC☆20May 14, 2020Updated 5 years ago
- Numpy implementation of Gaussian Process Regression☆11May 27, 2019Updated 6 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 7 years ago
- Master's Degree final thesis project: reduce emergency vehicles travel time using V2V communications in VEINS simulator☆11Jan 9, 2022Updated 4 years ago
- Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games☆12Feb 15, 2018Updated 8 years ago
- code for "Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning"☆11May 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Dec 21, 2018Updated 7 years ago
- PPO with Hindsight Experience Replay (HER)☆12May 8, 2018Updated 8 years ago
- Creating a Yoga pose classification using Mediapipe with help of OpenCV☆20Sep 13, 2022Updated 3 years ago
- Powershell cmdlet for rendering image files to console☆34Mar 19, 2023Updated 3 years ago
- ☆33Oct 17, 2018Updated 7 years ago
- Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:☆13May 21, 2022Updated 3 years ago
- AdvanceControl☆11Dec 13, 2022Updated 3 years ago
- Python tool allowing easy book downloads from the terminal☆12Mar 15, 2023Updated 3 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Program to import raw brainwaves, and using FFT and Frequency Index calculate various bands of brainwaves.☆12Nov 7, 2016Updated 9 years ago
- Using RL-controlled vehicles as traffic regulator to reduce the travel time of emergency vehicles near intersections☆11Jan 27, 2022Updated 4 years ago
- Expectimax AI for the game 2048☆16May 29, 2014Updated 11 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- Round Levels indicator to display round level zones and lines in MetaTrader 4, MetaTrader 5, and cTrader platforms.☆13Apr 20, 2026Updated 2 weeks ago
- CS277 Project: Deep Reinforcement Learning in portfolio Management. This repo is the DQN part which implements a trading agent based on t…☆14Jan 19, 2020Updated 6 years ago
- A pytorch implementation of "Latent Variable Dialogue Models and their Diversity"☆18Nov 30, 2017Updated 8 years ago