☆30Mar 1, 2022Updated 4 years ago
Alternatives and similar repositories for mbpo_pytorch
Users that are interested in mbpo_pytorch are comparing it to the libraries listed below
Sorting:
- ☆15Sep 14, 2020Updated 5 years ago
- A beamer template for LAMDA lab at NJU☆16Oct 17, 2020Updated 5 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆185Apr 12, 2022Updated 3 years ago
- Re-implementations of SOTA RL algorithms.☆137Sep 7, 2023Updated 2 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆72Feb 7, 2023Updated 3 years ago
- NeurIPS Reproducibility Challenge 2019☆20Feb 25, 2020Updated 6 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Jun 2, 2022Updated 3 years ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Nov 16, 2021Updated 4 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- RLA is a tool for managing your RL experiments automatically☆32Jan 11, 2025Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆17Feb 14, 2024Updated 2 years ago
- ☆16Jun 30, 2019Updated 6 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- Replicating Imagination-Augmented Agents for Deep Reinforcement Learning☆20Dec 17, 2017Updated 8 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 11 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆191May 17, 2022Updated 3 years ago
- CS285课程笔记☆24Jan 19, 2020Updated 6 years ago
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆25Feb 28, 2022Updated 4 years ago
- Code for FOCAL Paper Published at ICLR 2021☆55Dec 4, 2023Updated 2 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆18Jul 9, 2024Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated last year
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 3 years ago
- Official implementation of NeurIPS'23 paper "Macro Placement by Wire-Mask-Guided Black-Box Optimization"☆30May 23, 2025Updated 9 months ago
- ☆26Jun 14, 2022Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Nov 14, 2024Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Dec 19, 2023Updated 2 years ago
- EEN: Error Encoding Network☆66Dec 3, 2017Updated 8 years ago
- Benchmarked implementations of Offline RL Algorithms.☆77Mar 4, 2025Updated last year
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 4 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆77Jun 23, 2023Updated 2 years ago