MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
☆19May 24, 2018Updated 8 years ago
Alternatives and similar repositories for deep-reinforcement-learning_DDQN_PPO_HER
Users that are interested in deep-reinforcement-learning_DDQN_PPO_HER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving coordinated (two intersections) transit signal priority on bus travel time and headway reliability with single agent reinforcem…☆14Oct 2, 2021Updated 4 years ago
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24May 3, 2019Updated 7 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- python, ccxt, backtrader, dash☆10Apr 20, 2018Updated 8 years ago
- Code for an optimal velocity model (OVM) and a multiple car following (MCF) model☆11Sep 14, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [Re] Next Generation Reservoir Computing☆11Dec 12, 2022Updated 3 years ago
- ☆12Jun 28, 2019Updated 6 years ago
- ☆12Dec 14, 2021Updated 4 years ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆12Nov 1, 2022Updated 3 years ago
- ☆12Jul 13, 2023Updated 2 years ago
- ☆11Dec 23, 2024Updated last year
- Online Resource Repository: Datasets, Simulation Platforms, and Empirical Research on Emerging Mixed Traffic of Automated Vehicles and Hu…☆16Nov 29, 2023Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Oct 4, 2020Updated 5 years ago
- Here is an implementation of some of a few results seen in Early Visual Concept Learning with Unsupervised Deep Learning☆28Oct 2, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆155Oct 25, 2021Updated 4 years ago
- STM32F103xB Firmware. CMSIS-DAP (USB HID) + 2 x High-speed UART (USB CDC) + UART/SLCAN + USB-I2C☆12Feb 8, 2020Updated 6 years ago
- Numpy implementation of Gaussian Process Regression☆11May 27, 2019Updated 7 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 7 years ago
- Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games☆12Feb 15, 2018Updated 8 years ago
- littlefs module for Zephyr, not a mirror of the official littlefs repository☆19May 20, 2025Updated last year
- code for "Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning"☆11May 2, 2024Updated 2 years ago
- Using very few experiments to efficiently learn an approximate model of an n-qubit quantum process.☆16Apr 15, 2023Updated 3 years ago
- PPO with Hindsight Experience Replay (HER)☆12May 8, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…☆13Oct 7, 2022Updated 3 years ago
- Powershell cmdlet for rendering image files to console☆34Mar 19, 2023Updated 3 years ago
- ☆33Oct 17, 2018Updated 7 years ago
- Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:☆13May 21, 2022Updated 4 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- Using RL-controlled vehicles as traffic regulator to reduce the travel time of emergency vehicles near intersections☆11Jan 27, 2022Updated 4 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- A simple command line chess game☆15Jun 30, 2021Updated 4 years ago
- 🎧 Real-time data streaming from NeuroSky MindWave Mobile Headset☆10Jul 17, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Opensource embedded controller firmware for sipeed boards.☆14Apr 15, 2019Updated 7 years ago
- repository for my TLDR for deep learning papers (and SML papers!)☆17Jun 2, 2017Updated 8 years ago
- The Linux kernel for OrangePi A64☆12Mar 3, 2020Updated 6 years ago
- ♕ A web based and Deep-Reinforcement-Learning-powered open source chess game.☆18Feb 22, 2026Updated 3 months ago
- PaddleOCR for Chinese pdf☆15Jan 12, 2022Updated 4 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- ☆17Feb 25, 2024Updated 2 years ago