Code for Policy Consolidation for Continual Reinforcement Learning
☆10May 12, 2019Updated 6 years ago
Alternatives and similar repositories for policy_consolidation
Users that are interested in policy_consolidation are comparing it to the libraries listed below
Sorting:
- ☆10Oct 11, 2022Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆19Jul 11, 2023Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- ☆20Jun 14, 2022Updated 3 years ago
- Code for Continual Reinforcement Learning with Multi-Timescale Replay☆24Apr 16, 2020Updated 5 years ago
- ☆20Apr 10, 2018Updated 7 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆108Jun 18, 2022Updated 3 years ago
- ☆26Mar 16, 2023Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Reweighted Expectation Maximization☆29Jun 14, 2019Updated 6 years ago
- ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피를 생성해주는 모델을 제작하였습니다!!☆11Dec 28, 2021Updated 4 years ago
- Code to minimize the Variational Contrastive Divergence (VCD)☆29May 30, 2019Updated 6 years ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆123Dec 28, 2020Updated 5 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- ☆33Aug 30, 2024Updated last year
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- A minimal Unreal Engine project for developing and testing UnrealCV☆17Nov 8, 2018Updated 7 years ago
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Sep 1, 2022Updated 3 years ago
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- ☆11Jun 5, 2023Updated 2 years ago
- pytorch faster r-cnn☆11Dec 21, 2020Updated 5 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆10Apr 20, 2016Updated 9 years ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆23Oct 14, 2025Updated 4 months ago
- ☆12Jun 16, 2023Updated 2 years ago
- Rendering code for ShapeNet models☆11Apr 20, 2017Updated 8 years ago
- On Robustness of Neural Ordinary Differential Equations☆11Oct 12, 2021Updated 4 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- R packages to extract data from the https://kc.humanitarianresponse.info/ using API, recode and aggregate survey data. The library is bui…☆13Aug 21, 2019Updated 6 years ago
- ☆14Jul 18, 2025Updated 7 months ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago