coreweave / ml-containers
☆30Updated last week
Alternatives and similar repositories for ml-containers:
Users that are interested in ml-containers are comparing it to the libraries listed below
- Module, Model, and Tensor Serialization/Deserialization☆217Updated last month
- ☆170Updated last week
- A top-like tool for monitoring GPUs in a cluster☆86Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.☆23Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆122Updated 3 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆84Updated last week
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated 3 months ago
- The Triton backend for the PyTorch TorchScript models.☆144Updated last week
- Repository for open inference protocol specification☆49Updated 8 months ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Updated 4 years ago
- ☆54Updated 6 months ago
- Cloud Native Benchmarking of Foundation Models☆24Updated 4 months ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆89Updated this week
- MLPerf™ logging library☆33Updated last week
- benchmarking some transformer deployments☆26Updated last year
- The driver for LMCache core to run in vLLM☆35Updated last month
- ☆62Updated 3 weeks ago
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆286Updated this week
- Distributed ML Optimizer☆30Updated 3 years ago
- The Triton backend for the ONNX Runtime.☆140Updated last week
- Benchmark suite for LLMs from Fireworks.ai☆69Updated last month
- ☆48Updated 4 months ago
- In-depth code associated with my Medium blog post, "How to Load PyTorch Models 340 Times Faster with Ray"☆26Updated 2 years ago
- Train, tune, and infer Bamba model☆86Updated 2 months ago
- CUDA checkpoint and restore utility☆306Updated last month
- ☆237Updated last week
- Pygloo provides Python bindings for Gloo.☆21Updated 3 weeks ago
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆99Updated this week
- OpenVINO backend for Triton.☆31Updated last week