Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
☆74Aug 31, 2024Updated last year
Alternatives and similar repositories for powderworld
Users that are interested in powderworld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆171Jun 23, 2023Updated 2 years ago
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022☆28Jul 10, 2022Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38May 16, 2023Updated 2 years ago
- ☆13Aug 9, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆23May 25, 2023Updated 2 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- A high-performance reinforcement learning library in jax specialized for robotic learning☆22Sep 4, 2023Updated 2 years ago
- ☆32Mar 19, 2024Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆118Dec 21, 2022Updated 3 years ago
- ☆19Mar 1, 2023Updated 3 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆62Aug 3, 2023Updated 2 years ago
- Code for the paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains☆10Nov 12, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Learning Robust Dynamics Through Variational Sparse Gating☆20Oct 19, 2022Updated 3 years ago
- A simple wrapper to analyse and visualise reinforcement learning agents' behaviour in the environment.☆14Jan 8, 2022Updated 4 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆28May 22, 2023Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆106May 17, 2022Updated 3 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆13Jul 11, 2022Updated 3 years ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆79Jun 4, 2024Updated last year
- ☆117Apr 28, 2023Updated 2 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆78Jun 23, 2023Updated 2 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated last month
- A toolkit for practical Human-AI cooperation research☆14Apr 19, 2024Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆99Jul 5, 2023Updated 2 years ago
- ☆35Jan 4, 2023Updated 3 years ago
- Temporally Correlated Episodic Reinforcement Learning, ICLR 24☆12Apr 8, 2024Updated 2 years ago
- Atari-style POMDPs☆27Updated this week
- Challenging Memory-based Deep Reinforcement Learning Agents☆111Oct 27, 2024Updated last year
- ☆26Apr 26, 2024Updated last year
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- gym RL environment in which a mujoco simulation of Agility Robotics' Cassie robot is rewarded for walking/running forward as fast as poss…☆35Nov 17, 2023Updated 2 years ago
- ☆259Mar 11, 2026Updated last month
- 🎢 Creating and sharing simulation environments for embodied and synthetic data research☆193Apr 3, 2026Updated last week
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- ☆58Sep 22, 2022Updated 3 years ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago