mohmdelsayed / streaming-drl
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆225Updated this week
Alternatives and similar repositories for streaming-drl:
Users that are interested in streaming-drl are comparing it to the libraries listed below
- Implementation of Dreamer v3 in pytorch.☆511Updated 5 months ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆72Updated 7 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆170Updated 9 months ago
- Multi-Agent Reinforcement Learning with JAX☆541Updated 2 weeks ago
- ☆257Updated 3 years ago
- ☆217Updated 4 months ago
- ☆79Updated 9 months ago
- A collection of MARL benchmarks based on TorchRL☆366Updated this week
- ☆268Updated 2 years ago
- ☆78Updated 3 weeks ago
- A benchmark for offline goal-conditioned RL and offline RL☆139Updated 3 weeks ago
- Online Decision Transformer☆251Updated last year
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆475Updated 3 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆149Updated this week
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆264Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆138Updated last year
- ☆333Updated last year
- ☆70Updated last year
- A framework for Reinforcement Learning research.☆142Updated last month
- Transformer-based World Models☆78Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆294Updated last month
- Repo for Implicit Diffusion Q-Learning☆104Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆114Updated last month
- PyTorch implementation of DreamerV2 model-based RL algorithm☆216Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆90Updated 7 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆349Updated 3 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆313Updated 7 months ago