stephen-chung-mh / thinker
Thinker project
☆12Updated 4 months ago
Alternatives and similar repositories for thinker:
Users that are interested in thinker are comparing it to the libraries listed below
- An Open-Ended Agentic Simulator☆36Updated 5 months ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆12Updated 6 months ago
- ☆34Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 9 months ago
- POPGym Library in JAX☆11Updated 9 months ago
- General Modules for JAX☆62Updated 5 months ago
- ☆29Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Code for magnetic mirror descent.☆15Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 9 months ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- ☆15Updated 8 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- ☆45Updated last year
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆14Updated 8 months ago
- ☆40Updated last year
- Conservative Q learning in Jax☆52Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 7 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 6 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 2 months ago
- A high-performance reinforcement learning library in jax specialized for robotic learning☆22Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆18Updated last year
- Standalone library of frequently-used wrappers for dm_env environments.☆18Updated 6 months ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆23Updated last year
- Corax: Core RL in JAX☆36Updated 10 months ago
- ☆28Updated 3 years ago
- ☆31Updated 10 months ago