dematsunaga / alberdice
Official PyTorch implementation of AlberDICE
☆22Updated last year
Alternatives and similar repositories for alberdice:
Users that are interested in alberdice are comparing it to the libraries listed below
- An open source benchmark for Multi Agent Reinforcement Learning☆29Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆65Updated last year
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆39Updated 6 months ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆69Updated 3 years ago
- ☆11Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆120Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆30Updated 4 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆55Updated 10 months ago
- ☆17Updated 2 months ago
- Representation Learning for RL☆123Updated 2 years ago
- Official code repository for Prompt-DT.☆107Updated 2 years ago
- ☆48Updated last year
- ☆46Updated 2 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆75Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆77Updated 3 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆34Updated last month
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆16Updated last month
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆64Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆110Updated last year
- Simple maze environments using mujoco-py☆54Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 9 months ago
- ☆78Updated 3 weeks ago
- ☆39Updated last year
- A PyTorch implementation of Advantage weighted Actor-Critic (AWAC)☆54Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆135Updated 10 months ago
- ☆108Updated last year
- A PyTorch implementation of Implicit Q-Learning☆75Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆138Updated last year