Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers
☆24Feb 15, 2023Updated 3 years ago
Alternatives and similar repositories for ac-teach
Users that are interested in ac-teach are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bayesian Soft Actor Critic☆16Jan 6, 2023Updated 3 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Le…☆16Jul 2, 2022Updated 3 years ago
- Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"☆17Mar 24, 2023Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- ☆18Jul 15, 2019Updated 6 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- Code for FOCAL Paper Published at ICLR 2021☆55Dec 4, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- ☆25Oct 22, 2015Updated 10 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)☆12Feb 9, 2023Updated 3 years ago
- Official code for "Task-Embedded Control Networks for Few-Shot Imitation Learning".☆46Nov 29, 2019Updated 6 years ago
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Mar 22, 2022Updated 4 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Jan 21, 2021Updated 5 years ago
- Calibrating LLM Confidence by Probing Perturbed Representation Stability☆18Jul 5, 2025Updated 10 months ago
- PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).☆32Oct 27, 2021Updated 4 years ago
- ☆12Dec 20, 2018Updated 7 years ago
- Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.☆17Feb 17, 2021Updated 5 years ago
- support code for "Leveraging Contact Forces for Learning to Grasp" , ICRA 2019☆23May 29, 2019Updated 6 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- Deep PILCO PyTorch Implementation☆15Mar 25, 2023Updated 3 years ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14May 23, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Feb 3, 2022Updated 4 years ago
- ☆11Jun 17, 2016Updated 9 years ago
- ☆14Nov 23, 2023Updated 2 years ago
- [ICLR 2025] Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficien. The frist Mamba/Mamba2 MBRL agent.☆30Feb 5, 2025Updated last year
- Reproducing results of Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World https://arxiv.org/abs…☆27Dec 27, 2022Updated 3 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆53Feb 16, 2020Updated 6 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆34Feb 16, 2020Updated 6 years ago