ucbrise / caravel
Studying GPU Multi-tenancy
☆12Updated 6 years ago
Alternatives and similar repositories for caravel:
Users that are interested in caravel are comparing it to the libraries listed below
- Runtime Tracing Library for TensorFlow☆43Updated 6 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- Fault-tolerant for DL frameworks☆70Updated last year
- 碩士論文文獻筆記(Deep Learning、Scheduling、Distributed、Kubernetes)☆50Updated 5 years ago
- An Efficient Dynamic Resource Scheduler for Deep Learning Clusters☆42Updated 7 years ago
- A Kubernetes operator for mxnet jobs☆53Updated 3 years ago
- SCV is a distributed cluster GPU sniffer. SCV是一个分布式GPU嗅探器☆21Updated 2 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆28Updated 5 years ago
- Model-less Inference Serving☆88Updated last year
- ☆82Updated 2 years ago
- BytePS examples (Vision, NLP, GAN, etc)☆19Updated 2 years ago
- Fine-grained GPU sharing primitives☆141Updated 5 years ago
- Paper Reading:涉及分布式、虚拟化、网络、机器学习☆23Updated 4 years ago
- CS294-162; Machine Learning Systems Seminar☆31Updated 2 years ago
- Machine Learning System☆14Updated 4 years ago
- ☆11Updated 4 years ago
- Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation ov…☆58Updated 2 years ago
- Release doc/tutorial/wheels for poseidon-tf☆10Updated 7 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆52Updated 2 years ago
- ☆18Updated 7 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆98Updated 3 years ago
- CUPTI GPU Profiler☆37Updated 6 years ago
- Implementation for MIT 6.824 Distributed System 2016☆7Updated 8 years ago
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data☆64Updated 8 months ago
- My paper/code reading notes in Chinese☆46Updated 10 months ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆52Updated 7 months ago
- Repository for SysML19 Artifacts Evaluation☆53Updated 6 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Updated last year
- The schedule of the seminar☆25Updated 3 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆36Updated 5 years ago