coreweave / ml-containers
☆30Updated last week
Alternatives and similar repositories for ml-containers:
Users that are interested in ml-containers are comparing it to the libraries listed below
- Module, Model, and Tensor Serialization/Deserialization☆225Updated 2 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆126Updated this week
- A top-like tool for monitoring GPUs in a cluster☆86Updated last year
- ☆205Updated last month
- ☆50Updated 5 months ago
- Pygloo provides Python bindings for Gloo.☆22Updated 2 months ago
- NVIDIA NCCL Tests for Distributed Training☆88Updated last week
- A collection of reproducible inference engine benchmarks☆29Updated 2 weeks ago
- Benchmark suite for LLMs from Fireworks.ai☆70Updated 2 months ago
- ☆53Updated 7 months ago
- ☆304Updated 8 months ago
- The Triton backend for the PyTorch TorchScript models.☆148Updated this week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆94Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.☆26Updated this week
- The Triton backend for TensorFlow.☆51Updated 3 weeks ago
- Distributed ML Optimizer☆32Updated 3 years ago
- The driver for LMCache core to run in vLLM☆38Updated 3 months ago
- ☆68Updated last month
- Example ML projects that use the Determined library.☆32Updated 7 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆33Updated last month
- OpenVINO backend for Triton.☆31Updated 2 weeks ago
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆110Updated this week
- Fast and memory-efficient exact attention☆68Updated last week
- High-performance safetensors model loader☆25Updated 3 weeks ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆65Updated last year
- CUDA checkpoint and restore utility☆330Updated 3 months ago
- Repository for open inference protocol specification☆54Updated 9 months ago
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆32Updated last year
- pytorch code examples for measuring the performance of collective communication calls in AI workloads☆16Updated 6 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆9Updated this week