openpsi-project / srlLinks
A Really Scalable RL Framework to 10k+ CPUs
☆38Updated last year
Alternatives and similar repositories for srl
Users that are interested in srl are comparing it to the libraries listed below
Sorting:
- A high-performance, scalable MindSpore reinforcement learning framework.☆51Updated last year
- A distributed GPU-centric experience replay system for large AI models.☆19Updated 2 years ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Updated last year
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆328Updated 7 months ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated 2 years ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆92Updated 3 months ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆46Updated 4 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Updated 5 years ago
- siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems☆309Updated this week
- ☆33Updated 2 years ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆50Updated 8 months ago
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- A simple 2D ball collision engine.☆12Updated 2 years ago
- A collection of LLM with RL papers☆279Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆118Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Updated 8 months ago
- Keeping track of RL experiments☆165Updated 3 years ago
- A Massively Parallel Large Scale Self-Play Framework☆358Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year
- Online Decision Transformer☆273Updated last year
- ☆88Updated 2 years ago
- A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from trainin…☆109Updated last week
- Bridge Megatron-Core to Hugging Face/Reinforcement Learning☆172Updated this week
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆280Updated last month
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- ☆244Updated last year
- A set of competitive environments for Reinforcement Learning research.☆29Updated 3 years ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆66Updated last year
- ☆91Updated 3 years ago