ucbrise / caravelLinks
Studying GPU Multi-tenancy
☆12Updated 6 years ago
Alternatives and similar repositories for caravel
Users that are interested in caravel are comparing it to the libraries listed below
Sorting:
- Fault-tolerant for DL frameworks☆70Updated 2 years ago
- A Kubernetes operator for mxnet jobs☆53Updated 3 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆99Updated 3 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- 碩士論文文獻筆記(Deep Learning、Scheduling、Distributed、Kubernetes)☆51Updated 6 years ago
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆159Updated 6 years ago
- CS294-162; Machine Learning Systems Seminar☆31Updated 2 years ago
- An Efficient Dynamic Resource Scheduler for Deep Learning Clusters☆42Updated 7 years ago
- Paper Reading:涉及分布式、虚拟化、网络、机器学习☆23Updated 5 years ago
- ☆18Updated 7 years ago
- Kubernetes Scheduler for Deep Learning☆262Updated 3 years ago
- ☆13Updated 6 years ago
- ☆83Updated 3 months ago
- Building Machine Learning Infrastructure!☆45Updated 6 years ago
- 滴滴云推理服务的 HTTP 客户端示例代码☆21Updated 2 years ago
- Kernel for Kubeflow in Jupyter Notebook☆66Updated 6 years ago
- DevComm-Shanghai Weekly 上海地区高校技术社团联合周报(欢迎投稿)☆66Updated 2 months ago
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 2 years ago
- ☆16Updated 4 years ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆35Updated last year
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data☆65Updated last year
- High-performance key-value store☆12Updated 6 years ago
- Automatic tuning for ML model deployment on Kubernetes☆81Updated 11 months ago
- NVIDIA device plugin for Kubernetes☆15Updated 6 years ago
- Fine-grained GPU sharing primitives☆144Updated 2 months ago
- Elastic Serverless Serving based on Kubernetes, provides 0 instance serving capability.☆10Updated 3 years ago
- Runtime Tracing Library for TensorFlow☆43Updated 6 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Updated last year
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆59Updated 3 years ago
- ☆58Updated 5 years ago