openpsi-project / srl
A Really Scalable RL Framework to 10k+ CPUs
☆23Updated 11 months ago
Alternatives and similar repositories for srl:
Users that are interested in srl are comparing it to the libraries listed below
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆13Updated 9 months ago
- A distributed GPU-centric experience replay system for large AI models.☆16Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 5 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆33Updated 2 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆83Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated 3 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆53Updated 6 months ago
- ☆19Updated 7 months ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆68Updated last week
- ☆29Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- A high-performance, scalable MindSpore reinforcement learning framework.☆44Updated 7 months ago
- Learn online intrinsic rewards from LLM feedback☆34Updated last month
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆41Updated 6 months ago
- ☆46Updated last month
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆46Updated 6 months ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆12Updated 3 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Official code repository for Prompt-DT.☆102Updated 2 years ago
- ☆87Updated 2 years ago
- ☆26Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆12Updated last year
- ☆69Updated last year
- Distributed DRL by Ray and TensorFlow Tutorial.☆9Updated 5 years ago
- JAX bindings for Flash Attention v2☆85Updated 6 months ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆103Updated last year