inclusionAI / asystem-awexLinks
A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows
☆118Updated 2 weeks ago
Alternatives and similar repositories for asystem-awex
Users that are interested in asystem-awex are comparing it to the libraries listed below
Sorting:
- ByteCheckpoint: An Unified Checkpointing Library for LFMs☆256Updated last month
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆250Updated 3 weeks ago
- Toolchain built around the Megatron-LM for Distributed Training☆81Updated last month
- An early research stage expert-parallel load balancer for MoE models based on linear programming.☆481Updated last month
- The driver for LMCache core to run in vLLM☆59Updated 11 months ago
- torchcomms: a modern PyTorch communications API☆315Updated last week
- KV cache store for distributed LLM inference☆378Updated last month