thaihungle / MBEC-plusLinks
Model-based Episodic Control & Complementary Learning Systems
☆17Updated 4 years ago
Alternatives and similar repositories for MBEC-plus
Users that are interested in MBEC-plus are comparing it to the libraries listed below
Sorting:
- ☆14Updated last year
- Episodic Policy Gradient Training☆17Updated 3 years ago
- Memory-augmented Encoder Decoder Architecture☆14Updated 5 years ago
- Demo code for AJCAI22-Tutorial☆11Updated 3 years ago
- Source code for paper "Plug, Play, and Generalize: Length Extrapolation with Pointer-Augmented Neural Memory"☆13Updated last year
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated 9 months ago
- Source code for Stable Hadamard Memory☆23Updated 9 months ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Updated last year
- ☆38Updated last year
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- ☆57Updated last year
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆47Updated 4 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Updated 3 years ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆35Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- Neural Logic Inductive Learning☆44Updated 3 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Updated 2 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆67Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Updated last year
- ☆41Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆17Updated 3 years ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆47Updated 3 years ago
- Self-attentive Associative Memory & SAM-based Two-Memory Model☆60Updated 3 years ago
- Reward Propagation using Graph Convolutional Networks☆13Updated 4 years ago
- Avalanche fork adding RL support☆78Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- An unofficial implementation for online decision transformer☆41Updated 3 years ago
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆17Updated 2 years ago
- Official implementation of the Informed Dreamer algorithm, based on DreamerV3☆19Updated last week