volcengine / veTurboIOLinks
A library developed by Volcano Engine for high-performance reading and writing of PyTorch model files.
☆22Updated 8 months ago
Alternatives and similar repositories for veTurboIO
Users that are interested in veTurboIO are comparing it to the libraries listed below
Sorting:
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆267Updated 2 years ago
- ☆58Updated 5 years ago
- Efficient and easy multi-instance LLM serving☆475Updated 2 weeks ago
- GLake: optimizing GPU memory management and IO transmission.☆478Updated 5 months ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆105Updated 3 months ago
- NVIDIA Inference Xfer Library (NIXL)☆587Updated this week
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆121Updated last year
- Automatic tuning for ML model deployment on Kubernetes☆81Updated 10 months ago
- PyTorch distributed training acceleration framework☆52Updated 3 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆110Updated last week
- GPU-scheduler-for-deep-learning☆210Updated 4 years ago
- ☆288Updated last week
- KV cache store for distributed LLM inference☆321Updated 2 months ago
- Toolchain built around the Megatron-LM for Distributed Training☆61Updated 3 weeks ago
- Forked form☆11Updated 4 years ago
- Offline optimization of your disaggregated Dynamo graph☆54Updated this week
- ☆220Updated 2 years ago
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆201Updated last week
- A low-latency & high-throughput serving engine for LLMs☆409Updated 3 months ago
- Fine-grained GPU sharing primitives☆143Updated last month
- ☆47Updated 8 months ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆128Updated last year
- Fault-tolerant for DL frameworks☆70Updated 2 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆88Updated 7 months ago
- Fast and memory-efficient exact attention☆91Updated last week
- TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.☆95Updated 2 years ago
- The driver for LMCache core to run in vLLM☆48Updated 7 months ago
- Elastic Deep Learning for deep learning framework on Kubernetes☆174Updated 2 years ago
- Kubernetes Scheduler for Deep Learning☆262Updated 3 years ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆127Updated 3 years ago