vincentkslim / cs285_homework_fall2020
CS285 Homework
☆26Updated 4 years ago
Alternatives and similar repositories for cs285_homework_fall2020
Users that are interested in cs285_homework_fall2020 are comparing it to the libraries listed below
Sorting:
- Pytorch solutions for UC Berkeley's cs285 assignments☆138Updated 3 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆121Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- Benchmarked implementations of Offline RL Algorithms.☆72Updated 2 months ago
- ☆47Updated 2 years ago
- ☆88Updated 2 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆46Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆178Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- Re-implementations of SOTA RL algorithms.☆132Updated last year
- Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)☆124Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)☆146Updated 2 years ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆26Updated 3 years ago
- ☆267Updated 3 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆91Updated 8 months ago
- RLA is a tool for managing your RL experiments automatically☆72Updated 2 years ago
- ☆129Updated 9 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆135Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆86Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆47Updated last year
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆21Updated 3 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆18Updated last year
- ☆111Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆64Updated 11 months ago
- Official code repository for Prompt-DT.☆109Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated 2 years ago