microsoft / rl-offline-simulation
Data-driven offline simulation for online reinforcement learning: benchmark and baselines
☆29Updated 9 months ago
Alternatives and similar repositories for rl-offline-simulation:
Users that are interested in rl-offline-simulation are comparing it to the libraries listed below
- Imitation learning from multiple experts☆12Updated 2 years ago
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆51Updated last year
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆25Updated 2 years ago
- ☆13Updated last year
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 5 years ago
- ☆32Updated 8 months ago
- Explore and Control with Adversarial Surprise☆10Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 6 months ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Updated 8 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆16Updated 3 years ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- Code for demonstration example-task in RUDDER blog☆23Updated 4 years ago
- ☆24Updated 8 months ago
- INTeractive learning via REPresentatIon Discovery☆34Updated 10 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- ☆10Updated 2 years ago
- The official implementation of Memory-efficient DQN algorithm.☆10Updated last year
- ☆31Updated 2 years ago
- ☆16Updated 3 years ago
- Explainable Reinforcement Learning (XRL) Resources☆37Updated 7 months ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- ☆31Updated 2 years ago
- Benchmark data for d3rlpy☆20Updated last year
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆14Updated 3 years ago
- ☆36Updated 3 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- ☆16Updated 2 years ago