softsys4ai / unicorn
A Framework for Reasoning about System Performance using Causal AI
☆42Updated 2 years ago
Alternatives and similar repositories for unicorn:
Users that are interested in unicorn are comparing it to the libraries listed below
- How much energy do GenAI models consume?☆41Updated 3 months ago
- Machine learning on serverless platform☆8Updated 2 years ago
- Surrogate-based Hyperparameter Tuning System☆28Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- ☆43Updated 3 years ago
- ☆18Updated 3 years ago
- Modyn is a research-platform for training ML models on growing datasets.☆38Updated this week
- This repository contains code for the paper: Bergsma S., Zeyl T., Senderovich A., and Beck J. C., "Generating Complex, Realistic Cloud Wo…☆42Updated 3 years ago
- Privacy Budget Orchestration in Machine Learning Workloads (OSDI '21)☆24Updated last year
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 4 years ago
- ☆21Updated 3 years ago
- Implementation of algorithms for memory optimized deep neural network training☆9Updated 4 years ago
- Carbon Explorer helps evaluating solutions make datacenters operate on renewable energy.☆69Updated 2 months ago
- Dynamic resources changes for multi-dimensional parallelism training☆21Updated 2 months ago
- Serverless for all computation☆41Updated last year
- A resilient distributed training framework☆88Updated 9 months ago
- Distributed tracing data from Meta's microservices architecture.☆19Updated last year
- a deep learning-driven scheduler for elastic training in deep learning clusters☆28Updated 4 years ago
- ☆14Updated 5 months ago
- The source code of INFless,a native serverless platform for AI inference.☆36Updated 2 years ago
- Learning-Based Coded Computation☆46Updated 2 years ago
- Major CS conference publication stats (including accepted and submitted) by year.☆117Updated 3 weeks ago
- MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters☆17Updated last year
- [SIGCOMM 2021] ARROW: Restoration-Aware Traffic Engineering☆14Updated 3 years ago
- Dorylus: Affordable, Scalable, and Accurate GNN Training☆77Updated 3 years ago
- ☆12Updated 8 months ago
- ☆14Updated last year
- Main repository of the BeFaaS project☆14Updated last year
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆42Updated 3 years ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆53Updated 8 months ago