Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
Alternatives and similar repositories for IDAC
Users that are interested in IDAC are comparing it to the libraries listed below
Sorting:
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation☆15Jun 22, 2020Updated 5 years ago
- Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration☆18Feb 9, 2021Updated 5 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Randomized Value Functions via Multiplicative Normalizing Flows☆17Jan 1, 2023Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Jan 21, 2021Updated 5 years ago
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆23Oct 9, 2020Updated 5 years ago
- Distributional Soft Actor Critic☆60Jun 6, 2020Updated 5 years ago
- ☆23Jun 8, 2021Updated 4 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆27Jul 14, 2021Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- Repository for studying distributional rl☆30Feb 2, 2025Updated last year
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- WMG agent☆34Oct 3, 2023Updated 2 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- # Best Track Data (HURDAT2) Atlantic hurricane database (HURDAT2) 1851-2018 (5.9MB download) This dataset was provided on 10 May 2019 to…☆10Nov 5, 2019Updated 6 years ago
- ☆11May 13, 2021Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆13Aug 15, 2023Updated 2 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆44Oct 4, 2020Updated 5 years ago
- Neuronal Circuit Policies☆41Jul 21, 2022Updated 3 years ago
- ☆21Feb 12, 2026Updated 2 weeks ago
- 🤖Artificial intelligence classify a food 🍎 nutritional table by a simple photo. Don't eat 🍔🍕🌮...☆10May 7, 2020Updated 5 years ago
- ☆12Oct 11, 2022Updated 3 years ago
- neuralpy - neural network library written in python☆12Jun 25, 2023Updated 2 years ago
- The ROS interface as well as the Python packages for ProSeCo Planning☆10Jun 17, 2024Updated last year
- Official code release for Deep Extreme Mixture Model by Wilson, McDonald, Galib, Tan, and Luo.☆10Feb 11, 2022Updated 4 years ago
- Tensorflow implementation of "MC2-Net: Motion Correction Network for Multi-Contrast Brain MRI"☆12Dec 8, 2022Updated 3 years ago
- Course materials for a 3-day seminar "Machine Learning and NLP: Advances and Applications" at New College of Florida☆12Feb 10, 2022Updated 4 years ago
- Nonequispaced FFTs on GPUs (based on NFFT: http://www.nfft.org)☆11Apr 30, 2018Updated 7 years ago
- ☆28Oct 27, 2025Updated 4 months ago
- Learning to draw samples: with application to amortized maximum likelihood estimator for generative adversarial learning☆10Dec 28, 2021Updated 4 years ago
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆18Jan 26, 2025Updated last year
- ☆10Mar 10, 2021Updated 4 years ago
- Cellular automata traffic simulation☆11Jan 18, 2021Updated 5 years ago