CentML / DeepView.PredictLinks
๐ฎ Execution time predictions for deep neural network training iterations across different GPUs.
โ14Updated last year
Alternatives and similar repositories for DeepView.Predict
Users that are interested in DeepView.Predict are comparing it to the libraries listed below
Sorting:
- โ38Updated 7 months ago
- MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clustersโ20Updated 2 years ago
- Artifacts for our NSDI'23 paper TGSโ95Updated last year
- โ24Updated 2 years ago
- โ84Updated 3 years ago
- Thunder Research Group's Collective Communication Libraryโ47Updated 6 months ago
- An interference-aware scheduler for fine-grained GPU sharingโ159Updated 2 months ago
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020โ137Updated last year
- Helios Traces from SenseTimeโ61Updated 3 years ago
- โ52Updated 3 years ago
- ๐ฎ Execution time predictions for deep neural network training iterations across different GPUs.โ63Updated 3 years ago
- Artifacts for our ASPLOS'23 paper ElasticFlowโ55Updated last year
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.โ55Updated 3 years ago
- Compiler for Dynamic Neural Networksโ45Updated 2 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU scheโฆโ104Updated 3 years ago
- Paella: Low-latency Model Serving with Virtualized GPU Schedulingโ67Updated last year
- โ53Updated last year
- Synthesizer for optimal collective communication algorithmsโ124Updated last year
- Artifacts for our SIGCOMM'22 paper Muriโ43Updated 2 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)โ93Updated 2 years ago
- NCCL Profiling Kitโ150Updated last year
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systemsโ236Updated 2 weeks ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusionโ32Updated last year
- Repository for MLCommons Chakra schema and toolsโ153Updated 3 months ago
- Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobsโ58Updated 2 years ago
- Model-less Inference Servingโ93Updated 2 years ago
- โ44Updated last year
- Microsoft Collective Communication Libraryโ381Updated 2 years ago
- โ23Updated 2 years ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketchesโ80Updated 2 years ago