stephen-chung-mh / thinker
Thinker project
☆14Updated 6 months ago
Alternatives and similar repositories for thinker:
Users that are interested in thinker are comparing it to the libraries listed below
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆45Updated 9 months ago
- ☆44Updated last year
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆20Updated 4 months ago
- Code for magnetic mirror descent.☆16Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 10 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Python library for easily making web Apps to compare humans and AI☆16Updated last month
- ☆35Updated 2 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- ☆16Updated 11 months ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 11 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 11 months ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- PAIRED in PyTorch 🔥☆58Updated 2 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 5 months ago
- ☆30Updated 4 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆12Updated 8 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- General Modules for JAX☆64Updated last month
- ☆18Updated 4 years ago
- Conservative Q learning in Jax☆53Updated 2 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18Updated 2 years ago
- ☆47Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- POPGym Library in JAX☆11Updated 11 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ☆17Updated 2 years ago