MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
☆19May 24, 2018Updated 7 years ago
Alternatives and similar repositories for deep-reinforcement-learning_DDQN_PPO_HER
Users that are interested in deep-reinforcement-learning_DDQN_PPO_HER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving coordinated (two intersections) transit signal priority on bus travel time and headway reliability with single agent reinforcem…☆14Oct 2, 2021Updated 4 years ago
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24May 3, 2019Updated 6 years ago
- Modification of SOMPY repo with robust K-means clustering (bootstrapped SSE elbow method)☆13Apr 6, 2019Updated 6 years ago
- python, ccxt, backtrader, dash☆10Apr 20, 2018Updated 7 years ago
- ☆11Dec 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Curiosity-driven Exploration by Self-supervised Prediction☆24Jun 13, 2019Updated 6 years ago
- ☆13Jan 16, 2018Updated 8 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Oct 4, 2020Updated 5 years ago
- Here is an implementation of some of a few results seen in Early Visual Concept Learning with Unsupervised Deep Learning☆28Oct 2, 2016Updated 9 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆154Oct 25, 2021Updated 4 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- USDX indicator calculates and displays the US dollar index in the separate window of any other chart.☆11Aug 8, 2025Updated 7 months ago
- Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games☆12Feb 15, 2018Updated 8 years ago
- ☆12Dec 21, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PPO with Hindsight Experience Replay (HER)☆11May 8, 2018Updated 7 years ago
- A thorough, straightforward, un-intimidating introduction to Gaussian processes in NumPy.☆16Jun 12, 2018Updated 7 years ago
- Forex Trend Finder App in React Native with Redux for Harvard CS50 Final Project☆12Dec 9, 2022Updated 3 years ago
- Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:☆13May 21, 2022Updated 3 years ago
- Program to import raw brainwaves, and using FFT and Frequency Index calculate various bands of brainwaves.☆12Nov 7, 2016Updated 9 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- AdvanceControl☆11Dec 13, 2022Updated 3 years ago
- Neuroproc dataset descriptions and dictionaries☆16Jan 2, 2017Updated 9 years ago
- Python tool allowing easy book downloads from the terminal☆12Mar 15, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Using RL-controlled vehicles as traffic regulator to reduce the travel time of emergency vehicles near intersections☆11Jan 27, 2022Updated 4 years ago
- Round Levels indicator to display round level zones and lines in MetaTrader 4 and MetaTrader 5 platforms.☆12Jun 4, 2025Updated 9 months ago
- Expectimax AI for the game 2048☆16May 29, 2014Updated 11 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- A pytorch implementation of "Latent Variable Dialogue Models and their Diversity"☆18Nov 30, 2017Updated 8 years ago
- 🎧 Real-time data streaming from NeuroSky MindWave Mobile Headset☆10Jul 17, 2020Updated 5 years ago
- repository for my TLDR for deep learning papers (and SML papers!)☆16Jun 2, 2017Updated 8 years ago
- Opensource embedded controller firmware for sipeed boards.☆14Apr 15, 2019Updated 6 years ago
- 智能网联车辆和人工驾驶车辆混合行驶异质交通流特性研究☆16Sep 16, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The Linux kernel for OrangePi A64☆11Mar 3, 2020Updated 6 years ago
- Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization"(https://arxiv.org/abs/1705.04304)☆18Aug 15, 2017Updated 8 years ago
- All in AI MODELS☆12Oct 14, 2023Updated 2 years ago
- Time-Contrastive Learning☆69May 30, 2018Updated 7 years ago
- A PyTorch Toolbox for Deep Reinforcement Learning☆10Jun 25, 2020Updated 5 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- PaddleOCR for Chinese pdf☆15Jan 12, 2022Updated 4 years ago