thaihungle / MBEC-plusLinks

Model-based Episodic Control & Complementary Learning Systems

☆14

Alternatives and similar repositories for MBEC-plus

Users that are interested in MBEC-plus are comparing it to the libraries listed below

Sorting:

thaihungle / MRPO
☆12Updated 7 months ago
thaihungle / EPGT
Episodic Policy Gradient Training
☆14Updated 3 years ago
thaihungle / MAED
Memory-augmented Encoder Decoder Architecture
☆12Updated 5 years ago
thaihungle / PANM
Source code for paper "Plug, Play, and Generalize: Length Extrapolation with Pointer-Augmented Neural Memory"
☆11Updated 9 months ago
salesforce / sibling-rivalry
Code for Sibling Rivalry and experiments presented in associated paper
☆18Updated 2 months ago
luckeciano / transformers-metarl
Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022
☆63Updated 2 years ago
thuml / SPOT
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
☆22Updated 2 years ago
ahjwang / messenger-emma
Implements the Messenger environment and EMMA model.
☆23Updated 2 years ago
lanyavik / BAIL
☆17Updated 3 years ago
thaihungle / SHM
Source code for Stable Hadamard Memory
☆17Updated 2 months ago
Stanford-ILIAD / Diverse-Conventions
Exploring techniques to generate diverse conventions in multi-agent settings
☆15Updated last year
gblackout / NLIL
Neural Logic Inductive Learning
☆44Updated 2 years ago
ahmed-touati / controllable_agent
☆46Updated 2 years ago
mansicer / Q-Adapter
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆17Updated 9 months ago
sfujim / SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆17Updated 3 years ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
eric-mitchell / macaw-min
Clean, extensible implementation of MACAW [ICML 2021]
☆12Updated 3 years ago
sumedh7 / CausalCuriosity
Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…
☆39Updated 4 years ago
twni2016 / Memory-RL
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆63Updated last year
microsoft / wmg_agent
WMG agent
☆35Updated last year
luchris429 / model-free-opponent-shaping
Code for Model-Free Opponent Shaping (ICML 2022)
☆19Updated 2 years ago
jesbu1 / hidio
Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options
☆45Updated 3 years ago
tung-nd / cwbc
☆11Updated 2 years ago
daochenzha / rapid
[ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.
☆59Updated 2 years ago
frt03 / generalized_dt
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆67Updated 2 years ago
machelreid / can-wikipedia-help-offline-rl
Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu
☆105Updated 3 years ago
Rondorf / BOReL
Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…
☆31Updated 3 years ago
mklissa / phi_gcn
Reward Propagation using Graph Convolutional Networks
☆13Updated 4 years ago
CR-Gjx / RIA
TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Le…
☆16Updated 3 years ago
Matt00n / PolicyGradientsJax
On-Policy Policy Gradient Algorithms in JAX
☆38Updated last year