microsoft / Intrepid
INTeractive learning via REPresentatIon Discovery
☆33Updated 10 months ago
Alternatives and similar repositories for Intrepid:
Users that are interested in Intrepid are comparing it to the libraries listed below
- GPT implementation in Flax☆18Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 10 months ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆111Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆80Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- PyTorch Package For Quasimetric Learning☆41Updated 5 months ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 3 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆71Updated 7 months ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Baselines for gymnax 🤖☆66Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated last year
- ☆31Updated last year
- ☆43Updated 6 months ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- ☆41Updated 8 months ago
- MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957☆64Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- My Body Is A Cage☆39Updated 3 years ago
- General Modules for JAX☆64Updated last month
- ☆74Updated last week
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 4 years ago