lucidrains / improving-transformers-world-model-for-rlLinks

Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch

☆149

Alternatives and similar repositories for improving-transformers-world-model-for-rl

Users that are interested in improving-transformers-world-model-for-rl are comparing it to the libraries listed below

Sorting:

lucidrains / evolutionary-policy-optimization
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University
☆103Updated 3 months ago
lucidrains / SAC-pytorch
Implementation of Soft Actor Critic and some of its improvements in Pytorch
☆64Updated 2 weeks ago
vmicheli / delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024
☆115Updated last year
RyanNavillus / Syllabus
Synchronized Curriculum Learning for RL Agents
☆118Updated 2 months ago
lucidrains / x-transformers-rl
Implementation of a transformer for reinforcement learning using `x-transformers`
☆72Updated 3 months ago
lucidrains / scaling-vin-pytorch
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
☆37Updated last year
wang-kevin3290 / scaling-crl
☆231Updated last month
lucidrains / dreamer4
Implementation of Danijar's latest iteration for his Dreamer line of work
☆156Updated last week
vladisai / PLDM
☆51Updated 2 months ago
chandar-lab / Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
☆79Updated last year
rail-berkeley / SUPE
This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."
☆35Updated 6 months ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆130Updated 6 months ago
yilundu / ired_code_release
☆82Updated last year
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆112Updated 2 years ago
DAVIAN-Robotics / SimbaV2
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆82Updated 2 months ago
lucidrains / RL-100
Implementation of RL-100, Performant Robotic Manipulation with Real-World Reinforcement Learning
☆54Updated last month
facebookresearch / mtm
MTM Masked Trajectory Models for Prediction, Representation, and Control.
☆162Updated last month
FLAIROx / jafar
JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"
☆97Updated 11 months ago
lucidrains / ppo
An implementation of PPO in Pytorch
☆105Updated last week
CLAIRE-Labo / EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆122Updated last month
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆35Updated 6 months ago
mohmdelsayed / streaming-drl
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆272Updated 9 months ago
burchim / TWISTER
[ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)
☆45Updated 10 months ago
marc-rigter / polygrad-world-models
Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024
☆73Updated last year
NanboLi / FACTS
[ICLR 2025] Implementation of "FACTS: A Factored State-Space Framework For World Modelling"
☆28Updated 7 months ago
lucidrains / vit-arc-slot
Explorations into improving ViTArc with Slot Attention
☆43Updated last year
weipu-zhang / STORM
☆121Updated last month
kvfrans / splus
☆122Updated 7 months ago
enjeeneer / zero-shot-rl
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆22Updated last year
seohongpark / fql
The official implementation of flow Q-learning (FQL)
☆270Updated 5 months ago