sven1977 / dreamer_v3Links
Implementation (TensorFlow/keras) of the DreamerV3 model-based RL algorithm by Hafner et al. 2023
☆3Updated 2 years ago
Alternatives and similar repositories for dreamer_v3
Users that are interested in dreamer_v3 are comparing it to the libraries listed below
Sorting:
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆13Updated 3 weeks ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 10 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆56Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 11 months ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆20Updated 2 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 3 years ago
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆15Updated last year
- ☆15Updated 2 years ago
- ☆10Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 9 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆26Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆29Updated 2 years ago
- ☆17Updated last year
- ☆47Updated 2 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆17Updated 6 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆55Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 3 years ago
- ☆31Updated last year
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆23Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year