microsoft / rl-offline-simulation
Data-driven offline simulation for online reinforcement learning: benchmark and baselines
☆27Updated 6 months ago
Alternatives and similar repositories for rl-offline-simulation:
Users that are interested in rl-offline-simulation are comparing it to the libraries listed below
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆49Updated last year
- Imitation learning from multiple experts☆12Updated 2 years ago
- INTeractive learning via REPresentatIon Discovery☆33Updated 7 months ago
- ☆36Updated 3 years ago
- This repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).☆27Updated last year
- ☆13Updated last year
- ☆9Updated 2 years ago
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- A lightweight reimplementation of Adversarially Trained Actor Critic☆18Updated last year
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆19Updated 6 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆38Updated 3 months ago
- ☆32Updated 6 months ago
- ☆75Updated 2 years ago
- An unofficial implementation for online decision transformer☆39Updated 2 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- ☆16Updated 3 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆13Updated 3 years ago
- ☆31Updated 10 months ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Updated last year
- Code for demonstration example-task in RUDDER blog☆22Updated 4 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- ☆85Updated 6 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆27Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- ☆50Updated last year
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- Reinforcement Learning via Supervised Learning☆69Updated 2 years ago