tung-nd / cwbc
☆11Updated 2 years ago
Alternatives and similar repositories for cwbc:
Users that are interested in cwbc are comparing it to the libraries listed below
- ☆14Updated 3 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 11 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆13Updated 3 years ago
- Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…☆31Updated 3 years ago
- ☆48Updated last year
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 3 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆16Updated 3 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆32Updated 3 months ago
- Code for FOCAL Paper Published at ICLR 2021☆52Updated last year
- code for the paper Offline Prioritized Experience Replay☆13Updated last year
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆17Updated 3 years ago
- ☆16Updated 2 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 3 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆19Updated 2 years ago
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆33Updated last year
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆13Updated 2 years ago
- ☆26Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆53Updated 3 years ago
- ☆44Updated last year
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆26Updated 3 years ago
- Conservative Q learning in Jax☆53Updated 2 years ago
- ☆30Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 4 years ago
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆21Updated last year
- Code for the paper "Learning Options via Compression" at NeurIPS 2022☆23Updated 2 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆17Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 2 years ago