ucbrise / caravelLinks
Studying GPU Multi-tenancy
☆12Updated 6 years ago
Alternatives and similar repositories for caravel
Users that are interested in caravel are comparing it to the libraries listed below
Sorting:
- Fault-tolerant for DL frameworks☆70Updated 2 years ago
- 碩士論文文獻筆記(Deep Learning、Scheduling、Distributed、Kubernetes)☆51Updated 6 years ago
- CS294-162; Machine Learning Systems Seminar☆31Updated 2 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆99Updated 3 years ago
- A Kubernetes operator for mxnet jobs☆53Updated 3 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- ☆18Updated 7 years ago
- Paper Reading:涉及分布式、虚拟化、网络、机器学习☆23Updated 5 years ago
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆160Updated 6 years ago
- Elastic Serverless Serving based on Kubernetes, provides 0 instance serving capability.☆10Updated 3 years ago
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 2 years ago
- ☆13Updated 6 years ago
- An Efficient Dynamic Resource Scheduler for Deep Learning Clusters☆42Updated 8 years ago
- Kernel for Kubeflow in Jupyter Notebook☆66Updated 6 years ago
- NVIDIA device plugin for Kubernetes☆15Updated 6 years ago
- DevComm-Shanghai Weekly 上海地区高校技术社团联合周报(欢迎投稿)☆66Updated 3 months ago
- Model factory is a ML training platform to help engineers to build ML models at scale☆18Updated 4 years ago
- Kubernetes Scheduler for Deep Learning☆262Updated 3 years ago
- SCV is a distributed cluster GPU sniffer. SCV是一个分布式GPU嗅探器☆21Updated 2 years ago
- The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes☆44Updated 4 years ago
- BytePS examples (Vision, NLP, GAN, etc)☆19Updated 2 years ago
- Elastic Deep Learning for deep learning framework on Kubernetes☆174Updated 2 years ago
- Fine-grained GPU sharing primitives☆146Updated 3 months ago
- This repository contains statistics about the AI Infrastructure products.☆17Updated 8 months ago
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data☆65Updated last year
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆35Updated last year
- Serverless ML Framework☆106Updated 3 years ago
- 滴滴云推理服务的 HTTP 客户端示例代码☆21Updated 2 years ago
- Building Machine Learning Infrastructure!☆45Updated 6 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆28Updated 6 years ago