LMCache / LMBenchmarkLinks
☆15Updated this week
Alternatives and similar repositories for LMBenchmark
Users that are interested in LMBenchmark are comparing it to the libraries listed below
Sorting:
- Cloud Native Benchmarking of Foundation Models☆36Updated 3 weeks ago
- A tool to detect infrastructure issues on cloud native AI systems☆36Updated 2 weeks ago
- Distributed KV cache coordinator☆31Updated 2 weeks ago
- A light weight vLLM simulator, for mocking out replicas.☆19Updated last week
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆83Updated last year
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆96Updated 2 months ago
- A tool for coordinated checkpoint/restore of distributed applications with CRIU☆22Updated this week
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆30Updated 5 months ago
- ☆30Updated last month
- Inference scheduler for llm-d☆47Updated last week
- Intelligent platform for AI workloads☆37Updated 2 years ago
- RDMA CNI plugin for containerized workloads☆52Updated 3 weeks ago
- Golang library for managing resctrl filesystem☆48Updated 8 months ago
- Artifacts for our NSDI'23 paper TGS☆75Updated 11 months ago
- Selected Topics in Computer Networks @ Johns Hopkins University☆19Updated 4 years ago
- ☆42Updated last year
- NVIDIA NCCL Tests for Distributed Training☆92Updated last week
- A simulator of Kuberntes for batch and service workload.☆46Updated 4 years ago
- A storage plugin that provided CRI-O/Podman with the ability to lazy mount nydus images.☆39Updated 3 weeks ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆67Updated last month
- An OS kernel module for fast **remote** fork using advanced datacenter networking (RDMA).☆63Updated 3 months ago
- GenAI inference performance benchmarking tool☆45Updated this week
- ☆31Updated 3 years ago
- Repository linking to the software artifacts used for the MigrOS ATC 2021 paper☆17Updated 4 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆121Updated last year
- Ths is a fast RDMA abstraction layer that works both in the kernel and user-space.☆56Updated 6 months ago
- InfiniBand SR-IOV CNI☆49Updated this week
- ☆62Updated last week
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆34Updated last year
- A startup benchmarking tool for Docker containers.☆70Updated 9 years ago