Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
☆38Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for LAP-PAL
Users that are interested in LAP-PAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Actor Prioritized Experience Replay☆18Nov 20, 2023Updated 2 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- OpenAI Gym Environment for ROS.☆13Nov 1, 2017Updated 8 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- A python package for loading robotics datasets which were recorded on the TriFinger platform. Also contains simulated gym environments th…☆17Jan 17, 2024Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆400Dec 18, 2021Updated 4 years ago
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated last month
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆25May 20, 2024Updated last year
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆660Apr 6, 2021Updated 4 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 5 years ago
- ☆15Jun 1, 2023Updated 2 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- ☆18Jul 25, 2024Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Aug 8, 2022Updated 3 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- ☆19Apr 22, 2024Updated last year
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- ☆14Jan 15, 2023Updated 3 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Nov 16, 2021Updated 4 years ago
- Apply safe RL methods from safety-starter-agents in highway-env☆14Jun 28, 2021Updated 4 years ago
- ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)☆50Feb 15, 2022Updated 4 years ago
- Library for model based RL in robotics☆37Sep 10, 2018Updated 7 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆64Apr 4, 2023Updated 2 years ago
- DrQ: Data regularized Q☆420Jan 13, 2023Updated 3 years ago
- ☆18Jul 13, 2022Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112May 27, 2024Updated last year
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆295Feb 24, 2021Updated 5 years ago
- Original code for the paper "Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping" by Mezghani et al.☆18Jun 8, 2023Updated 2 years ago
- code for paper "Two-Critic Deep Reinforcement Learning for Inverter-based Volt-Var Control in Active Distribution Networks"☆18Apr 10, 2024Updated last year
- An implementation of a full motion and behavior planning pipeline for a self-driving car in the CARLA simulator.☆15Mar 7, 2021Updated 5 years ago
- Code for the paper: "MIDAS: Multi-agent Interaction-aware Decision-making with Adaptive Strategies for Urban Autonomous Navigation"☆17Sep 21, 2021Updated 4 years ago
- This repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).☆29Jul 6, 2023Updated 2 years ago
- ☆17Jul 11, 2020Updated 5 years ago