microsoft / IntrepidLinks
INTeractive learning via REPresentatIon Discovery
☆34Updated last year
Alternatives and similar repositories for Intrepid
Users that are interested in Intrepid are comparing it to the libraries listed below
Sorting:
- Repo to reproduce the First-Explore paper results☆38Updated 10 months ago
- Generalised UDRL☆37Updated 3 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Sandbox environment for generalizable agent research☆25Updated 3 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆75Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆116Updated last year
- ☆28Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆45Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last month
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆137Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆33Updated 4 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆85Updated 3 years ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆84Updated last year
- ☆23Updated last year
- PyTorch Package For Quasimetric Learning☆43Updated 11 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- Reward Learning by Simulating the Past☆46Updated 6 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- ☆56Updated 11 months ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 3 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆69Updated 2 years ago
- A tool for recording RL trajectories.☆108Updated 3 months ago
- ☆32Updated last year
- MultiTask Environments for Reinforcement Learning.☆78Updated 3 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated last year
- ☆46Updated last year
- ☆19Updated 2 years ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Updated last year