bstadie / krazyworldView external linksLinks
krazy grid world
☆25Mar 2, 2020Updated 5 years ago
Alternatives and similar repositories for krazyworld
Users that are interested in krazyworld are comparing it to the libraries listed below
Sorting:
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆14Feb 3, 2023Updated 3 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆17Jun 29, 2025Updated 7 months ago
- Simple JAX Graphics Library.☆36Nov 3, 2024Updated last year
- Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…☆35Oct 24, 2025Updated 3 months ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- video prediction and world model research☆14Jun 10, 2022Updated 3 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- ☆16Aug 7, 2021Updated 4 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆69Jun 5, 2020Updated 5 years ago
- Behavioural cloning solution to MineRL2020 competition☆18Mar 6, 2021Updated 4 years ago
- ☆17Aug 3, 2022Updated 3 years ago
- ☆19Mar 1, 2023Updated 2 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- A dataloader, but for JAX☆20May 17, 2024Updated last year
- Scripts to recreate the D4RL datasets with Minari☆25Jul 21, 2025Updated 6 months ago
- ☆19Jun 25, 2023Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆27May 22, 2023Updated 2 years ago
- Parallelizing non-linear sequential models over the sequence length☆56Jun 23, 2025Updated 7 months ago
- ☆19Nov 25, 2022Updated 3 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Apr 1, 2022Updated 3 years ago
- ☆23Aug 19, 2022Updated 3 years ago
- An implementation of MuZero in JAX.☆57Nov 8, 2022Updated 3 years ago
- ☆29May 21, 2025Updated 8 months ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Aug 4, 2022Updated 3 years ago
- Code and links for over 25,000 trained Atari agents☆98Aug 22, 2024Updated last year
- ☆28Jul 28, 2022Updated 3 years ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆34Oct 28, 2025Updated 3 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Aug 22, 2024Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆67Jan 18, 2024Updated 2 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆371Jul 7, 2025Updated 7 months ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- ☆28Mar 13, 2019Updated 6 years ago
- Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…☆31Nov 23, 2021Updated 4 years ago
- Code for the publication Learning to Reason with Third-Order Tensor Products.☆41Jan 14, 2019Updated 7 years ago
- Reinforcement Learning of Active Vision for Manipulating Objects under Occlusions☆28May 23, 2019Updated 6 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆35Nov 28, 2018Updated 7 years ago