skezle / owl
☆15Updated 2 years ago
Alternatives and similar repositories for owl:
Users that are interested in owl are comparing it to the libraries listed below
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆32Updated last year
- ☆20Updated 2 years ago
- ☆16Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated 2 years ago
- ☆47Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆26Updated last year
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆24Updated last year
- Simple maze environments using mujoco-py☆54Updated last year
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆109Updated last year
- ☆29Updated 2 years ago
- ☆14Updated 3 years ago
- ☆53Updated last year
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Updated 11 months ago
- ☆54Updated 10 months ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 2 years ago
- ☆86Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆38Updated 2 months ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10Updated 5 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 5 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆28Updated last year
- ☆18Updated 2 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆51Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- ☆36Updated 3 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago