Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874
☆47Jan 21, 2021Updated 5 years ago
Alternatives and similar repositories for GAN-Q-Learning
Users that are interested in GAN-Q-Learning are comparing it to the libraries listed below
Sorting:
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- [NeurIPS 2020, Spotlight] Improved Schemes for Episodic Memory-based Lifelong Learning☆18Dec 12, 2020Updated 5 years ago
- Code release for paper "How good is my GAN?"☆12Mar 9, 2019Updated 6 years ago
- A style-transfer project using CycleGAN to render photos in the style of Studio Ghibli animations.☆11Feb 20, 2019Updated 7 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Heart beat interval sequence analysis.☆13Nov 11, 2017Updated 8 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Made for a reading group at the Center for Safe AGI.☆12Updated this week
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- ☆13Sep 11, 2018Updated 7 years ago
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 7 years ago
- ☆13Mar 31, 2024Updated last year
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 6 years ago
- Downloading and displaying academic ancestry☆16Oct 1, 2015Updated 10 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 9 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 2 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆11Jul 14, 2021Updated 4 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).☆16May 29, 2018Updated 7 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series☆17Sep 5, 2025Updated 5 months ago
- ☆16Nov 27, 2016Updated 9 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- meta-MADDPG (Python implementation)☆19Sep 16, 2018Updated 7 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 4 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago