microsoft / arkLinks
A GPU-driven system framework for scalable AI applications
☆116Updated 4 months ago
Alternatives and similar repositories for ark
Users that are interested in ark are comparing it to the libraries listed below
Sorting:
- Microsoft Collective Communication Library☆64Updated 7 months ago
- FlexFlow Serve: Low-Latency, High-Performance LLM Serving☆42Updated last month
- An experimental parallel training platform☆54Updated last year
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆379Updated this week
- A lightweight design for computation-communication overlap.☆143Updated last week
- NCCL Profiling Kit☆138Updated 11 months ago
- Aims to implement dual-port and multi-qp solutions in deepEP ibrc transport☆50Updated last month
- Synthesizer for optimal collective communication algorithms☆108Updated last year
- Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“☆61Updated last year
- ☆79Updated 2 years ago
- ☆27Updated last year
- ☆25Updated 4 months ago
- Automated Parallelization System and Infrastructure for Multiple Ecosystems☆79Updated 7 months ago
- An extension library of WMMA API (Tensor Core API)☆99Updated 11 months ago
- ☆44Updated 3 weeks ago
- ☆37Updated 6 months ago
- ☆39Updated 5 months ago
- High performance Transformer implementation in C++.☆125Updated 5 months ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling