ChristosKap / policy_consolidationView external linksLinks
Code for Policy Consolidation for Continual Reinforcement Learning
☆10May 12, 2019Updated 6 years ago
Alternatives and similar repositories for policy_consolidation
Users that are interested in policy_consolidation are comparing it to the libraries listed below
Sorting:
- ☆10Oct 11, 2022Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆19Jul 11, 2023Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- ☆20Jun 14, 2022Updated 3 years ago
- ☆20Apr 10, 2018Updated 7 years ago
- Code for Continual Reinforcement Learning with Multi-Timescale Replay☆24Apr 16, 2020Updated 5 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆106Jun 18, 2022Updated 3 years ago
- ☆26Mar 16, 2023Updated 2 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피 를 생성해주는 모델을 제작하였습니다!!☆11Dec 28, 2021Updated 4 years ago
- Reweighted Expectation Maximization☆29Jun 14, 2019Updated 6 years ago
- Code to minimize the Variational Contrastive Divergence (VCD)☆29May 30, 2019Updated 6 years ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆123Dec 28, 2020Updated 5 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- ☆33Aug 30, 2024Updated last year
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- A minimal Unreal Engine project for developing and testing UnrealCV☆17Nov 8, 2018Updated 7 years ago
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Sep 1, 2022Updated 3 years ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago
- ☆10Apr 20, 2016Updated 9 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 5 years ago
- Rendering code for ShapeNet models☆11Apr 20, 2017Updated 8 years ago
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- This repo contains PPO implementation in PyTorch for LunarLander-v2☆11Jun 26, 2020Updated 5 years ago
- Bayesian Regression Models using pymc3☆11Feb 4, 2017Updated 9 years ago
- On Robustness of Neural Ordinary Differential Equations☆11Oct 12, 2021Updated 4 years ago
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- pytorch faster r-cnn☆11Dec 21, 2020Updated 5 years ago
- 팡요랩 자료☆11May 31, 2019Updated 6 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago