si0wang / COPlannerLinks
☆23Updated last year
Alternatives and similar repositories for COPlanner
Users that are interested in COPlanner are comparing it to the libraries listed below
Sorting:
- Official repo for Offline RL for Online RL☆18Updated 2 years ago
- ☆25Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 3 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆34Updated last year
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆78Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆98Updated last month
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆68Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Updated 2 years ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆72Updated 4 months ago
- ☆28Updated last year
- ☆45Updated last month
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆68Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆91Updated 10 months ago
- Action Value Gradient Algorithm☆24Updated 5 months ago
- ☆35Updated 2 years ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆109Updated 2 years ago
- ☆34Updated 4 months ago
- PWM: Policy Learning with Large World Models☆58Updated 2 months ago
- Meta-RL Model-Based Algorithm☆40Updated 6 months ago
- ☆25Updated 10 months ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆34Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Updated last year
- Evaluation of TD-MPC2.☆21Updated last year
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated 11 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆78Updated 2 years ago
- ☆24Updated last year
- 🔥Benchmarking of Neural Network Architectures in Reinforcement Learning.☆28Updated last month
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆80Updated last year