thaihungle / MBEC-plusLinks
Model-based Episodic Control & Complementary Learning Systems
☆16Updated 4 years ago
Alternatives and similar repositories for MBEC-plus
Users that are interested in MBEC-plus are comparing it to the libraries listed below
Sorting:
- ☆13Updated last year
- Episodic Policy Gradient Training☆16Updated 3 years ago
- Memory-augmented Encoder Decoder Architecture☆13Updated 5 years ago
- Source code for paper "Plug, Play, and Generalize: Length Extrapolation with Pointer-Augmented Neural Memory"☆12Updated last year
- Demo code for AJCAI22-Tutorial☆11Updated 3 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated 9 months ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Updated last year
- Source code for Stable Hadamard Memory☆22Updated 8 months ago
- ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"☆33Updated 2 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆67Updated 2 years ago
- Self-attentive Associative Memory & SAM-based Two-Memory Model☆59Updated 3 years ago
- Neural Logic Inductive Learning☆44Updated 3 years ago
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆23Updated 2 years ago
- Reward Propagation using Graph Convolutional Networks☆13Updated 4 years ago
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Updated 2 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- ☆57Updated last year
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆73Updated 2 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Updated 4 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Updated 5 years ago
- ☆35Updated 3 years ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆17Updated 4 months ago
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- Tensorflow implementation of SNAIL and RL2☆11Updated 6 years ago
- Representation Learning in RL☆13Updated 3 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Updated 7 months ago
- Code for Neural Execution Engines: Learning to Execute Subroutines☆18Updated 5 years ago
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Updated last year
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Updated 4 years ago