microsoft / Intrepid
INTeractive learning via REPresentatIon Discovery
☆33Updated 3 months ago
Related projects: ⓘ
- GPT implementation in Flax☆18Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆13Updated last year
- ☆28Updated 2 years ago
- Sandbox environment for generalizable agent research☆22Updated 2 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆21Updated 3 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆51Updated 5 months ago
- ☆41Updated 5 months ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆81Updated 2 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆77Updated 2 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆83Updated 7 months ago
- My Body Is A Cage☆37Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆36Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Corax: Core RL in JAX☆30Updated 7 months ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 3 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆26Updated last year
- PyTorch Package For Quasimetric Learning☆38Updated last year
- Reinforcement Learning with Latent Flow☆42Updated 3 years ago
- ☆56Updated last month
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated last year
- Scalable Opponent Shaping Experiments in JAX☆19Updated 5 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆42Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆22Updated 2 months ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆36Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆32Updated 2 years ago