thaihungle / MBEC-plusLinks
Model-based Episodic Control & Complementary Learning Systems
☆14Updated 3 years ago
Alternatives and similar repositories for MBEC-plus
Users that are interested in MBEC-plus are comparing it to the libraries listed below
Sorting:
- ☆12Updated 9 months ago
- Episodic Policy Gradient Training☆15Updated 3 years ago
- Source code for paper "Plug, Play, and Generalize: Length Extrapolation with Pointer-Augmented Neural Memory"☆11Updated 11 months ago
- Memory-augmented Encoder Decoder Architecture☆12Updated 5 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆18Updated 4 months ago
- Source code for Stable Hadamard Memory☆20Updated 4 months ago
- Self-attentive Associative Memory & SAM-based Two-Memory Model☆57Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆63Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆30Updated last year
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆90Updated 2 years ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆70Updated 2 years ago
- ☆48Updated 2 years ago
- Reward Propagation using Graph Convolutional Networks☆13Updated 4 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Updated 11 months ago
- ☆22Updated 5 months ago
- On-Policy Policy Gradient Algorithms in JAX☆39Updated last year
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆30Updated last year
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated last year
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated 2 years ago
- [ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.☆59Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆30Updated last year
- ☆35Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆32Updated 3 months ago
- ☆11Updated 2 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Updated last year
- WMG agent☆34Updated last year
- Code for our TMLR paper "Distributional GFlowNets with Quantile Flows".☆12Updated last year