openpsi-project / srlLinks
A Really Scalable RL Framework to 10k+ CPUs
☆33Updated last year
Alternatives and similar repositories for srl
Users that are interested in srl are comparing it to the libraries listed below
Sorting:
- A distributed GPU-centric experience replay system for large AI models.☆18Updated last year
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 9 months ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆48Updated 7 months ago
- Odysseus: Playground of LLM Sequence Parallelism☆70Updated 11 months ago
- ☆30Updated 2 years ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆44Updated 4 years ago
- A high-performance, scalable MindSpore reinforcement learning framework.☆48Updated 11 months ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Updated 5 years ago
- ☆18Updated 6 years ago
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆38Updated 2 years ago
- Automated Parallelization System and Infrastructure for Multiple Ecosystems☆79Updated 6 months ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆57Updated last year
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆299Updated last month
- ☆49Updated 2 weeks ago
- Estimate MFU for DeepSeekV3☆24Updated 5 months ago
- ☆74Updated 4 years ago
- Allow torch tensor memory to be released and resumed later☆33Updated this week
- ☆21Updated last month
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 6 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆58Updated 8 months ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆29Updated 2 weeks ago
- ☆62Updated 11 months ago
- Sequence-level 1F1B schedule for LLMs.☆17Updated last year
- Distributed ML Optimizer☆32Updated 3 years ago
- PyTorch bindings for CUTLASS grouped GEMM.☆93Updated last week
- An Attention Superoptimizer☆21Updated 4 months ago
- Python package for rematerialization-aware gradient checkpointing☆24Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆51Updated 9 months ago