abhishm / PGQView external linksLinks
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
☆15Mar 9, 2017Updated 8 years ago
Alternatives and similar repositories for PGQ
Users that are interested in PGQ are comparing it to the libraries listed below
Sorting:
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 7 years ago
- ☆32Oct 17, 2018Updated 7 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Feb 6, 2018Updated 8 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Dec 13, 2018Updated 7 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- Models built with TensorFlow☆26Dec 5, 2018Updated 7 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 6 years ago
- Reinforcement Learning framework to facilitate development and use of scalable RL algorithms and applications☆61Mar 29, 2018Updated 7 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Dec 11, 2020Updated 5 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- ☆10Apr 21, 2017Updated 8 years ago
- Robotics Learning Note☆11Jun 22, 2018Updated 7 years ago
- Model for Udacity's challenge which uses end-to-end learning to predict steering angles from just front camera image as input for self dr…☆10Apr 21, 2017Updated 8 years ago
- Model-Based Generative Adversarial Imitation Learning☆89Mar 29, 2021Updated 4 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Feb 5, 2019Updated 7 years ago
- Replication of Uber Neuroevolution paper☆46Apr 14, 2018Updated 7 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Jun 19, 2017Updated 8 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13May 5, 2021Updated 4 years ago
- ☆12Dec 2, 2020Updated 5 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- AWS FastAPI deployment on top of ALB and ECS with Docker containers implementing ECS as the orchestration tool for an AWS-managed infrast…☆10May 22, 2023Updated 2 years ago
- A proxy for reverse engineering a communication protocol☆10Jan 17, 2021Updated 5 years ago
- ☆10May 13, 2025Updated 9 months ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Approximate Multiparametric Mixed-integer Convex Programming☆15May 16, 2019Updated 6 years ago
- Nauka is a collection of utilities for scientific experiments.☆15Jul 27, 2022Updated 3 years ago
- ☆11Sep 15, 2016Updated 9 years ago
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- PID controller playground in Python☆11Dec 9, 2016Updated 9 years ago
- Stochastic Variance Reduction Policy Gradient Estimation☆11Nov 6, 2018Updated 7 years ago
- Unreal Engine simulator for our self driving car training☆11Nov 18, 2021Updated 4 years ago
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- FNV hash collision generator☆12Mar 2, 2017Updated 8 years ago
- TensorFlow code for paper "Training Frankenstein's Creature to Stack: HyperTree Architecture Search"☆13Nov 14, 2018Updated 7 years ago