openpsi-project / srl
A Really Scalable RL Framework to 10k+ CPUs
☆18Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for srl
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆13Updated 6 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- A distributed GPU-centric experience replay system for large AI models.☆16Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated this week
- Distributed DRL by Ray and TensorFlow Tutorial.☆9Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago
- ☆17Updated 5 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- A high-performance, scalable MindSpore reinforcement learning framework.☆41Updated 4 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- ☆22Updated 10 months ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆11Updated last year
- Implementation of the paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆13Updated last month
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆109Updated this week
- ☆34Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆31Updated 7 months ago
- Implementation of BC-IRL and other IRL baselines☆25Updated last year
- ☆28Updated last year
- ☆86Updated 2 years ago
- ☆18Updated 5 years ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated last year
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆11Updated last month
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Dateset Reset Policy Optimization☆28Updated 7 months ago