sven1977 / dreamer_v3Links

Implementation (TensorFlow/keras) of the DreamerV3 model-based RL algorithm by Hafner et al. 2023

☆3

Alternatives and similar repositories for dreamer_v3

Users that are interested in dreamer_v3 are comparing it to the libraries listed below

Sorting:

51616 / marl-lipo
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆19Updated last year
frt03 / jax_dt
Minimal Decision Transformer Implementation written in Jax (Flax).
☆17Updated 3 years ago
RajGhugare19 / stitching-is-combinatorial-generalisation
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆23Updated last year
nmonette / NCC-UED
Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025
☆13Updated 3 weeks ago
ml-jku / L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
☆59Updated 10 months ago
danijar / crafter-baselines
Docker containers of baseline agents for the Crafter environment
☆28Updated 3 years ago
micahcarroll / uniMASK
Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
☆56Updated last year
facebookresearch / gen_dgrl
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆28Updated 11 months ago
SAIC-MONTREAL / hyperzero
Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"
☆20Updated 2 years ago
clvrai / create
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
☆18Updated 2 years ago
sdpkjc / abcdrl
Modular Single-file Reinfocement Learning Algorithms Library
☆38Updated 2 years ago
frt03 / generalized_dt
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆67Updated 3 years ago
jacooba / hyper
Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …
☆15Updated last year
sail-sg / offbench
☆15Updated 2 years ago
aliang8 / varibad_jax
☆10Updated last year
brownirl / lambda_discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆18Updated 9 months ago
max7born / decision-lstm
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆27Updated 2 years ago
proceduralia / high_replay_ratio_continuous_control
Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"
☆26Updated 2 years ago
sail-sg / rosmo
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆29Updated 2 years ago
RyanNavillus / reward-surfaces
☆17Updated last year
ikostrikov / jaxrl2
☆47Updated 2 years ago
enjeeneer / zero-shot-rl
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆17Updated 6 months ago
Div99 / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆87Updated 2 years ago
architsharma97 / earl_benchmark
EARL: Environment for Autonomous Reinforcement Learning
☆37Updated 2 years ago
tinkoff-ai / ReBRAC
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆55Updated 2 years ago
scottemmons / rvs
Reinforcement Learning via Supervised Learning
☆71Updated 3 years ago
qgallouedec / lge
☆31Updated last year
qlan3 / Jaxplorer
Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.
☆13Updated last year
sahandrez / homomorphic_policy_gradient
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆23Updated last year
kvfrans / fre
Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"
☆57Updated last year