Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874
☆47Jan 21, 2021Updated 5 years ago
Alternatives and similar repositories for GAN-Q-Learning
Users that are interested in GAN-Q-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- Companion code for ICML 2022 paper "Imitation Learning by Estimating Expertise of Demonstrators"☆11Jul 5, 2023Updated 2 years ago
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Mar 27, 2024Updated 2 years ago
- Multi-Agent Determinantal Q-Learning☆43Nov 22, 2022Updated 3 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 3 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the following publication: F. B. Mismar, J. Choi, and B. L. Evans, "A Framework for Automated Cellular Network Tuning with Rein…☆50Jan 24, 2022Updated 4 years ago
- SeqGAN but with more bells and whistles☆24Feb 15, 2018Updated 8 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- [NeurIPS 2020, Spotlight] Improved Schemes for Episodic Memory-based Lifelong Learning☆18Dec 12, 2020Updated 5 years ago
- Dynamic Power Management using Reinforcement Learning for IoT devices.☆11Oct 23, 2021Updated 4 years ago
- ICML'20: Intrinsic Reward Driven Imitation Learning via Generative Model☆15Nov 5, 2021Updated 4 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆150Apr 13, 2023Updated 3 years ago
- Reinforcement Learning with Perturbed Reward, AAAI 2020☆30Aug 2, 2024Updated last year
- learning robust rewards with adversarial inverse reinforcement learning☆14Sep 13, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).☆16May 29, 2018Updated 8 years ago
- RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs (DAC'23)☆11Apr 13, 2023Updated 3 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Jan 27, 2019Updated 7 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆11Jul 14, 2021Updated 4 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- meta-MADDPG (Python implementation)☆19Sep 16, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆66May 25, 2020Updated 6 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆220Jun 19, 2019Updated 6 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Mar 19, 2020Updated 6 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆40Jan 22, 2021Updated 5 years ago
- krazy grid world☆25Mar 2, 2020Updated 6 years ago