A Really Scalable RL Framework to 10k+ CPUs
☆38Feb 29, 2024Updated 2 years ago
Alternatives and similar repositories for srl
Users that are interested in srl are comparing it to the libraries listed below
Sorting:
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆331Apr 24, 2025Updated 10 months ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated last year
- Official gym API for game FightingICE.☆15Feb 17, 2024Updated 2 years ago
- ☆27Jan 7, 2025Updated last year
- Official implementation of TBA for async LLM post-training.☆29Nov 5, 2025Updated 3 months ago
- Nex Venus Communication Library☆72Nov 17, 2025Updated 3 months ago
- Transformers (GTrXL & CoBERL) applied to RL tasks☆30Aug 18, 2022Updated 3 years ago
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated last month
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆557Nov 26, 2025Updated 3 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- Protocol buffers and other common resources.☆13Jan 20, 2026Updated last month
- A simple MIPS CPU for BUAA CO course (and now NSCSCC).☆10May 15, 2021Updated 4 years ago
- Official PyTorch Implementation of Federated Learning with Positive and Unlabeled Data☆10Aug 12, 2022Updated 3 years ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated last month
- [NeurIPS 2025] CodeCrash: Exposing LLM Fragility to Misleading Natural Language in Code Reasoning☆16Jan 24, 2026Updated last month
- A distributed stream querying engine that provides sub-millisecond stateful query at millions of queries per-second over fast-evolving li…☆10Jul 18, 2018Updated 7 years ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆40Feb 29, 2024Updated 2 years ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆91Feb 23, 2026Updated last week
- Accepted to MLSys 2026☆70Feb 22, 2026Updated last week
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆13Jul 12, 2025Updated 7 months ago
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year
- Cryptographically Secure Aggregation for Federated Learning☆11Jan 24, 2023Updated 3 years ago
- read source code of boltdb & re-implement it in c++☆12Jun 2, 2018Updated 7 years ago
- Automatic ReLU Reduction☆15Dec 20, 2023Updated 2 years ago
- [ICML 2025] Efficiently Serving Large Multimodal Models Using EPD Disaggregation☆22May 29, 2025Updated 9 months ago
- ☆11Sep 12, 2023Updated 2 years ago
- SJTU SE3331 CSE (a distributed file system with Raft and MapReduce)☆10Jan 14, 2024Updated 2 years ago
- ☆13Jan 7, 2025Updated last year
- a simple API to use CUPTI☆11Aug 19, 2025Updated 6 months ago
- My notes for reading leveldb☆11Apr 19, 2024Updated last year
- boost context 自实现协程和调度器。构建rpc框架☆10May 9, 2025Updated 9 months ago
- paper and code for New Directions in Cloud Programming, CIDR 2021☆11Feb 17, 2021Updated 5 years ago
- A compiled list of resources and materials for PPML☆11May 10, 2025Updated 9 months ago
- ACM Class 2017 Computer Architecture☆10Jan 11, 2018Updated 8 years ago
- Applying PBT optimization technique to different domains☆10Oct 16, 2019Updated 6 years ago
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.☆11Jun 26, 2021Updated 4 years ago
- TFLite python API package for parsing TFLite model☆12Jan 20, 2020Updated 6 years ago