jiangsy / mbpo_pytorchView external linksLinks
☆30Mar 1, 2022Updated 3 years ago
Alternatives and similar repositories for mbpo_pytorch
Users that are interested in mbpo_pytorch are comparing it to the libraries listed below
Sorting:
- ☆15Sep 14, 2020Updated 5 years ago
- A beamer template for LAMDA lab at NJU☆16Oct 17, 2020Updated 5 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆184Apr 12, 2022Updated 3 years ago
- Re-implementations of SOTA RL algorithms.☆136Sep 7, 2023Updated 2 years ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- RLA is a tool for managing your RL experiments automatically☆72Feb 7, 2023Updated 3 years ago
- NeurIPS Reproducibility Challenge 2019☆20Feb 25, 2020Updated 5 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Jun 2, 2022Updated 3 years ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Nov 16, 2021Updated 4 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- RLA is a tool for managing your RL experiments automatically☆31Jan 11, 2025Updated last year
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆530Nov 22, 2022Updated 3 years ago
- ☆16Jun 30, 2019Updated 6 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆17Feb 14, 2024Updated 2 years ago
- Replicating Imagination-Augmented Agents for Deep Reinforcement Learning☆20Dec 17, 2017Updated 8 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191May 17, 2022Updated 3 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- CS285课程笔记☆24Jan 19, 2020Updated 6 years ago
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆25Feb 28, 2022Updated 3 years ago
- Code for FOCAL Paper Published at ICLR 2021☆55Dec 4, 2023Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆22Apr 17, 2024Updated last year
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 2 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- Official implementation of NeurIPS'23 paper "Macro Placement by Wire-Mask-Guided Black-Box Optimization"☆30May 23, 2025Updated 8 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Nov 14, 2024Updated last year
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 2 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 3 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Dec 19, 2023Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆76Mar 4, 2025Updated 11 months ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 4 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆76Jun 23, 2023Updated 2 years ago
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- This is the public repo for the course HMMA238 'Software Development'☆10Apr 20, 2021Updated 4 years ago