aronsar / hoadLinks
☆14Updated 3 years ago
Alternatives and similar repositories for hoad
Users that are interested in hoad are comparing it to the libraries listed below
Sorting:
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12Updated 4 years ago
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- ☆52Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆120Updated 3 years ago
- ☆54Updated last year
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆81Updated 3 years ago
- Soft Actor-Critic☆156Updated 7 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆132Updated 4 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 4 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Updated 4 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆103Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆69Updated 3 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- Image-based gridworld experiment for learning Markov state abstractions☆21Updated last year
- Simple maze environments using mujoco-py☆57Updated 2 years ago
- ☆114Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆24Updated last year
- Official code repository for Prompt-DT.☆119Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆160Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112Updated last year
- Conservative Q learning in Jax☆56Updated 2 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆108Updated 3 years ago
- DMControl Generalization Benchmark☆186Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- Change-Based Exploration Transfer☆35Updated 3 years ago
- ☆48Updated 2 years ago
- ☆52Updated 2 years ago