Azure / MoneoLinks
Distributed AI/HPC Monitoring Framework
☆28Updated 8 months ago
Alternatives and similar repositories for Moneo
Users that are interested in Moneo are comparing it to the libraries listed below
Sorting:
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆122Updated 2 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆154Updated last week
- RDMA and SHARP plugins for nccl library☆217Updated last month
- NCCL Profiling Kit☆149Updated last year
- ☆47Updated last year
- ☆73Updated 11 months ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆66Updated last year
- ☆16Updated last year
- Fine-grained GPU sharing primitives☆147Updated 4 months ago
- Microsoft Collective Communication Library☆66Updated last year
- Aims to implement dual-port and multi-qp solutions in deepEP ibrc transport☆70Updated 7 months ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆57Updated last year
- Efficient Compute-Communication Overlap for Distributed LLM Inference☆66Updated last month
- Fast OS-level support for GPU checkpoint and restore☆261Updated 2 months ago
- SOTA Learning-augmented Systems☆37Updated 3 years ago
- oneCCL Bindings for Pytorch* (deprecated)☆102Updated last month
- Issues related to MLPerf® Inference policies, including rules and suggested changes☆64Updated last month
- ☆44Updated 4 years ago
- ☆56Updated 4 years ago
- A GPU-driven system framework for scalable AI applications☆123Updated 10 months ago
- ☆83Updated 6 months ago
- Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“☆64Updated last year
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆58Updated 5 months ago
- ☆70Updated 3 months ago
- Multi-Instance-GPU profiling tool☆58Updated 2 years ago
- Model-less Inference Serving☆92Updated 2 years ago
- A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node☆60Updated this week
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆285Updated 4 months ago
- pytorch ucc plugin☆23Updated 4 years ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆63Updated 3 years ago