MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
☆19May 24, 2018Updated 7 years ago
Alternatives and similar repositories for deep-reinforcement-learning_DDQN_PPO_HER
Users that are interested in deep-reinforcement-learning_DDQN_PPO_HER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving coordinated (two intersections) transit signal priority on bus travel time and headway reliability with single agent reinforcem…☆14Oct 2, 2021Updated 4 years ago
- First Prize Winner in HackOff-3.0 Siemens Healthineers Problem Statement number 3 on designing a Medical Chatbot.☆22Jul 11, 2023Updated 2 years ago
- Modification of SOMPY repo with robust K-means clustering (bootstrapped SSE elbow method)☆13Apr 6, 2019Updated 6 years ago
- Code for an optimal velocity model (OVM) and a multiple car following (MCF) model☆11Sep 14, 2018Updated 7 years ago
- ☆11Dec 23, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- introductory homework assignment to help get students set up for the rest of the course☆17Jan 27, 2026Updated 2 months ago
- ☆12Jun 28, 2019Updated 6 years ago
- ☆12Dec 14, 2021Updated 4 years ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆12Nov 1, 2022Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆24Jun 13, 2019Updated 6 years ago
- Course webpage for MAT335 at the University of Toronto☆14Apr 3, 2020Updated 5 years ago
- Online Resource Repository: Datasets, Simulation Platforms, and Empirical Research on Emerging Mixed Traffic of Automated Vehicles and Hu…☆16Nov 29, 2023Updated 2 years ago
- Here is an implementation of some of a few results seen in Early Visual Concept Learning with Unsupervised Deep Learning☆28Oct 2, 2016Updated 9 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆154Oct 25, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- STM32F103xB Firmware. CMSIS-DAP (USB HID) + 2 x High-speed UART (USB CDC) + UART/SLCAN + USB-I2C☆12Feb 8, 2020Updated 6 years ago
- Exercises for Web3 MOOC☆20May 14, 2020Updated 5 years ago
- Numpy implementation of Gaussian Process Regression☆11May 27, 2019Updated 6 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- Master's Degree final thesis project: reduce emergency vehicles travel time using V2V communications in VEINS simulator☆11Jan 9, 2022Updated 4 years ago
- Deep learning chess engine, that has no idea about chess rules, but watches and learns☆18Oct 24, 2017Updated 8 years ago
- USDX indicator calculates and displays the US dollar index in the separate window of any other chart.☆11Aug 8, 2025Updated 7 months ago
- Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games☆12Feb 15, 2018Updated 8 years ago
- Using very few experiments to efficiently learn an approximate model of an n-qubit quantum process.☆16Apr 15, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Creating a Yoga pose classification using Mediapipe with help of OpenCV☆19Sep 13, 2022Updated 3 years ago
- A thorough, straightforward, un-intimidating introduction to Gaussian processes in NumPy.☆16Jun 12, 2018Updated 7 years ago
- [NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…☆13Oct 7, 2022Updated 3 years ago
- Forex Trend Finder App in React Native with Redux for Harvard CS50 Final Project☆12Dec 9, 2022Updated 3 years ago
- Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:☆13May 21, 2022Updated 3 years ago
- ♕ A web based and Deep-Reinforcement-Learning-powered open source chess game.☆17Feb 22, 2026Updated last month
- Program to import raw brainwaves, and using FFT and Frequency Index calculate various bands of brainwaves.☆12Nov 7, 2016Updated 9 years ago
- AdvanceControl☆11Dec 13, 2022Updated 3 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python tool allowing easy book downloads from the terminal☆12Mar 15, 2023Updated 3 years ago
- Neuroproc dataset descriptions and dictionaries☆16Jan 2, 2017Updated 9 years ago
- Round Levels indicator to display round level zones and lines in MetaTrader 4 and MetaTrader 5 platforms.☆12Jun 4, 2025Updated 9 months ago
- Expectimax AI for the game 2048☆16May 29, 2014Updated 11 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- CS277 Project: Deep Reinforcement Learning in portfolio Management. This repo is the DQN part which implements a trading agent based on t…☆13Jan 19, 2020Updated 6 years ago
- A pytorch implementation of "Latent Variable Dialogue Models and their Diversity"☆18Nov 30, 2017Updated 8 years ago