openpsi-projects / srl
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
☆13Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for srl
- A Really Scalable RL Framework to 10k+ CPUs☆18Updated 8 months ago
- Paper Collection for Batch RL with brief introductions.☆85Updated 2 years ago
- ☆86Updated 2 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆43Updated last year
- A minimal and stable PPO.☆123Updated 9 months ago
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆78Updated 3 months ago
- Implementation of BC-IRL and other IRL baselines☆25Updated last year
- Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023☆51Updated 2 weeks ago
- Transformer-based World Models☆71Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- Repo for Implicit Diffusion Q-Learning☆93Updated 11 months ago
- OGBench: Benchmarking Offline Goal-Conditioned RL☆79Updated 3 weeks ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆62Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆92Updated this week
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆98Updated last year
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆46Updated 3 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆37Updated last week
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated 8 months ago
- ☆62Updated 5 months ago
- ☆235Updated 2 years ago
- Corax: Core RL in JAX