MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
☆19May 24, 2018Updated 8 years ago
Alternatives and similar repositories for deep-reinforcement-learning_DDQN_PPO_HER
Users that are interested in deep-reinforcement-learning_DDQN_PPO_HER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- Code for an optimal velocity model (OVM) and a multiple car following (MCF) model☆11Sep 14, 2018Updated 7 years ago
- [Re] Next Generation Reservoir Computing☆11Dec 12, 2022Updated 3 years ago
- ☆12Dec 14, 2021Updated 4 years ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆12Nov 1, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Dec 23, 2024Updated last year
- Online Resource Repository: Datasets, Simulation Platforms, and Empirical Research on Emerging Mixed Traffic of Automated Vehicles and Hu…☆16Nov 29, 2023Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Oct 4, 2020Updated 5 years ago
- Here is an implementation of some of a few results seen in Early Visual Concept Learning with Unsupervised Deep Learning☆28Oct 2, 2016Updated 9 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆155Oct 25, 2021Updated 4 years ago
- STM32F103xB Firmware. CMSIS-DAP (USB HID) + 2 x High-speed UART (USB CDC) + UART/SLCAN + USB-I2C☆12Feb 8, 2020Updated 6 years ago
- CATS Lab ACC data is the car-following trajectory dataset including both mix traffic and pure AV traffic.☆11Jan 6, 2023Updated 3 years ago
- Numpy implementation of Gaussian Process Regression☆11May 27, 2019Updated 7 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Master's Degree final thesis project: reduce emergency vehicles travel time using V2V communications in VEINS simulator☆11Jan 9, 2022Updated 4 years ago
- Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games☆12Feb 15, 2018Updated 8 years ago
- littlefs module for Zephyr, not a mirror of the official littlefs repository☆19May 20, 2025Updated last year
- code for "Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning"☆11May 2, 2024Updated 2 years ago
- USDX indicator calculates and displays the US dollar index in the separate window of any other chart.☆11Aug 8, 2025Updated 10 months ago
- Using very few experiments to efficiently learn an approximate model of an n-qubit quantum process.☆16Apr 15, 2023Updated 3 years ago
- PPO with Hindsight Experience Replay (HER)☆12May 8, 2018Updated 8 years ago
- A thorough, straightforward, un-intimidating introduction to Gaussian processes in NumPy.☆16Jun 12, 2018Updated 8 years ago
- Forex Trend Finder App in React Native with Redux for Harvard CS50 Final Project☆12Dec 9, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆33Oct 17, 2018Updated 7 years ago
- Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:☆13May 21, 2022Updated 4 years ago
- Python tool allowing easy book downloads from the terminal☆12Mar 15, 2023Updated 3 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- Program to import raw brainwaves, and using FFT and Frequency Index calculate various bands of brainwaves.☆12Nov 7, 2016Updated 9 years ago
- Using RL-controlled vehicles as traffic regulator to reduce the travel time of emergency vehicles near intersections☆11Jan 27, 2022Updated 4 years ago
- Expectimax AI for the game 2048☆16May 29, 2014Updated 12 years ago
- Round Levels indicator to display round level zones and lines in MetaTrader 4, MetaTrader 5, and cTrader platforms.☆14Apr 20, 2026Updated last month
- CS277 Project: Deep Reinforcement Learning in portfolio Management. This repo is the DQN part which implements a trading agent based on t…☆14Jan 19, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple command line chess game☆15Jun 30, 2021Updated 4 years ago
- 🎧 Real-time data streaming from NeuroSky MindWave Mobile Headset☆10Jul 17, 2020Updated 5 years ago
- 智能网联车辆和人工驾驶车辆混合行驶异质交通流特性研究☆17Sep 16, 2022Updated 3 years ago
- repository for my TLDR for deep learning papers (and SML papers!)☆17Jun 2, 2017Updated 9 years ago
- ♕ A web based and Deep-Reinforcement-Learning-powered open source chess game.☆18Feb 22, 2026Updated 3 months ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- 2D Optical flow using NVIDIA CUDA☆18Jul 23, 2021Updated 4 years ago