facebookresearch / distributed_traces
Distributed tracing data from Meta's microservices architecture.
☆16Updated last year
Related projects: ⓘ
- Serverless for all computation☆41Updated last year
- Machine learning on serverless platform☆7Updated 2 years ago
- Lightning In-Memory Object Store☆44Updated 2 years ago
- Nightcore: Efficient and Scalable Serverless Computing for Latency-Sensitive, Interactive Microservices [ASPLOS '21]☆97Updated 3 years ago
- ☆45Updated 11 months ago
- ☆21Updated 3 years ago
- Tiresias is a GPU cluster manager for distributed deep learning training.☆148Updated 4 years ago
- An Efficient Dynamic Resource Scheduler for Deep Learning Clusters☆41Updated 6 years ago
- ☆41Updated 3 years ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆35Updated last week
- A curated list of awesome serverless research works, including papers and open-sourced projects.☆75Updated last year
- MeshInsight: Dissecting Overheads of Service Mesh Sidecars☆42Updated 8 months ago
- ☆67Updated last year
- A universal workflow system for exactly-once DAGs☆23Updated last year
- FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute (USENIX ATC'21)☆53Updated 2 years ago
- ☆39Updated last year
- Deadline-based hyperparameter tuning on RayTune.☆31Updated 4 years ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆14Updated 3 years ago
- A benchmark suite for evaluating FaaS scheduler.☆22Updated last year
- ☆34Updated last year
- Fine-grained GPU sharing primitives☆139Updated 4 years ago
- Virtual Memory Abstraction for Serverless Architectures☆45Updated 2 years ago
- 🔋🎯 Thread-level, NUMA-aware energy attribution model for multi-tenancy☆52Updated last year
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆49Updated last month
- Resource Allocation for Dynamic Demands☆17Updated 8 months ago
- Surrogate-based Hyperparameter Tuning System☆26Updated last year
- This repository contains experimental tools we developed to forecast a clusters' resource (CPU or memory) usage.☆38Updated 3 years ago
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020☆124Updated last month
- ☆17Updated 3 years ago
- A Framework for Reasoning about System Performance using Causal AI☆41Updated 2 years ago