openpsi-project/srl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/openpsi-project/srl)

openpsi-project / srl

A Really Scalable RL Framework to 10k+ CPUs

☆38

Alternatives and similar repositories for srl

Users that are interested in srl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

openpsi-projects / srl
View on GitHub
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
☆15Apr 24, 2024Updated 2 years ago
openpsi-project / ReaLHF
View on GitHub
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
☆335Apr 24, 2025Updated last year
Aladoro / Stabilizing-Off-Policy-RL
View on GitHub
☆18Aug 3, 2022Updated 3 years ago
TeamFightingICE / Gym-FightingICE
View on GitHub
Official gym API for game FightingICE.
☆15Feb 17, 2024Updated 2 years ago
bbartoldson / TBA
View on GitHub
Official implementation of TBA for async LLM post-training.
☆31Nov 5, 2025Updated 8 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Terra-Flux / PolyRL
View on GitHub
[NSDI'26] PolyRL is a reinforcement learning framework for LLM that harvest spot instances on the cloud to reduce cost.
☆19Mar 30, 2026Updated 3 months ago
RodkinIvan / Transformer-RL
View on GitHub
Transformers (GTrXL & CoBERL) applied to RL tasks
☆29Aug 18, 2022Updated 3 years ago
TARTRL / TiZero
View on GitHub
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆14May 25, 2023Updated 3 years ago
Notify-ctrl / motalang
View on GitHub
臸娥粂陆亩竟
☆10May 11, 2024Updated 2 years ago
TonyTangYu / delta-examples
View on GitHub
☆12Apr 30, 2024Updated 2 years ago
tianpeiyang / MAPTF_code
View on GitHub
Source code for paper: An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
☆24Sep 2, 2022Updated 3 years ago
chhzh123 / ptc-tutorial
View on GitHub
PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo
☆17Mar 13, 2023Updated 3 years ago
shacklettbp / madrona
View on GitHub
☆507Nov 3, 2025Updated 8 months ago
vahide-b-84 / FaultTolerantTaskOffloadingSimulation
View on GitHub
This project proposes a DRL-based fault-tolerant task offloading method for Mobile Edge-Cloud Computing. Using a DDPG algorithm, it minim…
☆22Apr 28, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GusLovesMath / Llama3_MacSilicon
View on GitHub
Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…
☆11May 4, 2024Updated 2 years ago
joenghl / HYPO
View on GitHub
☆14Dec 29, 2023Updated 2 years ago
floringogianu / snrl
View on GitHub
Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.
☆11Jun 26, 2021Updated 5 years ago
zero-li / ZgoCompose
View on GitHub
android compose catalog
☆17Jul 4, 2025Updated last year
StephanXu / Ossian
View on GitHub
Ossian generic framework
☆12Aug 25, 2021Updated 4 years ago
semitable / seps
View on GitHub
Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)
☆10Oct 26, 2021Updated 4 years ago
AlvinWen428 / keyframe-focused-imitation-learning
View on GitHub
☆11Dec 13, 2021Updated 4 years ago
guillaumebort / mison
View on GitHub
Scala Mison implementation
☆15Nov 16, 2018Updated 7 years ago
cioc / ray-kubernetes
View on GitHub
Ray Framework (https://github.com/ray-project/ray) on Kubernetes
☆13Oct 12, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
rottaca / FallDetectionProject
View on GitHub
This is the code repository for a project at Ulm University. It's a fall detection system based on address-event-based cameras.
☆11Sep 29, 2017Updated 8 years ago
MichaelBeechan / Learning_TensorFlow-Kaggle_MNIST
View on GitHub
一步步带你通过项目（MNIST手写识别）学习入门TensorFlow以及神经网络的知识
☆11Oct 5, 2018Updated 7 years ago
elsheikh21 / population-based-training-of-NNs
View on GitHub
Applying PBT optimization technique to different domains
☆10Oct 16, 2019Updated 6 years ago
kzl / aop
View on GitHub
Official codebase for Adaptive Online Planning for Continual Lifelong Learning.
☆17Mar 26, 2020Updated 6 years ago
Hsword / Hetu
View on GitHub
A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …
☆126Dec 18, 2023Updated 2 years ago
SimondeMoreau / LED
View on GitHub
LED : Light Enhanced Depth Estimation at Night
☆15Mar 24, 2026Updated 4 months ago
vlad17 / mve
View on GitHub
MVE: model-based value estimation
☆11Jul 30, 2018Updated 7 years ago
divyahansg / RecurrentDPG
View on GitHub
CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)
☆10Jun 10, 2017Updated 9 years ago
NUS-HPC-AI-Lab / pytorch-lamb
View on GitHub
PyTorch implementation of LAMB for ImageNet/ResNet-50 training
☆13May 13, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Gladys-Zhao / mRNN-mLSTM
View on GitHub
Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?
☆17Jan 6, 2021Updated 5 years ago
HumanCompatibleAI / overcooked-demo
View on GitHub
Web application where humans can play Overcooked with AI agents.
☆60Dec 6, 2022Updated 3 years ago
realstolz / powerlyra
View on GitHub
Differentiated Computation and Partitioning on Skewed (Natural or Bipartite) Graphs
☆67Mar 30, 2022Updated 4 years ago
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆16May 19, 2023Updated 3 years ago
IrisLi17 / bridge_construction
View on GitHub
☆14Sep 29, 2021Updated 4 years ago
Xubbbb / chfs
View on GitHub
SJTU SE3331 CSE (a distributed file system with Raft and MapReduce)
☆10Jan 14, 2024Updated 2 years ago
inclusionAI / ASearcher
View on GitHub
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
☆602Nov 26, 2025Updated 8 months ago