openpsi-project / srlLinks
A Really Scalable RL Framework to 10k+ CPUs
☆33Updated last year
Alternatives and similar repositories for srl
Users that are interested in srl are comparing it to the libraries listed below
Sorting:
- A distributed GPU-centric experience replay system for large AI models.☆18Updated 2 years ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated 2 years ago
- A high-performance, scalable MindSpore reinforcement learning framework.☆50Updated last year
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Updated last year
- ☆31Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆61Updated last year
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆44Updated 4 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Updated 5 years ago
- A Massively Parallel Large Scale Self-Play Framework☆351Updated 2 years ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆309Updated 4 months ago
- A simple 2D ball collision engine.☆12Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆48Updated 5 months ago
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- ☆82Updated 2 years ago
- ☆25Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- ☆12Updated 3 years ago
- A collection of LLM with RL papers☆277Updated last year
- RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforce…☆49Updated last week
- Keeping track of RL experiments☆163Updated 2 years ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆50Updated last year
- Online Decision Transformer☆266Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆40Updated 9 months ago
- [NeurIPS 2022] 1st Place Solution for the 3rd Neural MMO Challenge☆30Updated 2 years ago
- ☆89Updated 2 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Updated 3 years ago
- ☆236Updated 9 months ago
- A set of competitive environments for Reinforcement Learning research.☆29Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆20Updated 5 months ago