pkusys / Jolteon
Automatic resource configuration for serverless workflows.
☆20Updated last year
Alternatives and similar repositories for Jolteon:
Users that are interested in Jolteon are comparing it to the libraries listed below
- ☆40Updated 2 years ago
- The source code of INFless,a native serverless platform for AI inference.☆38Updated 2 years ago
- ☆32Updated last year
- Artifacts for our SIGCOMM'23 paper Ditto☆15Updated last year
- AQUATOPE: QoS-and-Uncertainty-Aware Resource Management for Multi-Stage Serverless Workflows (ASPLOS'23)☆23Updated last year
- FaaSFlow: Enable Efficient Workflow Execution for Function-as-a-Service☆73Updated last year
- Artifacts for our NSDI'23 paper TGS☆75Updated 9 months ago
- Serverless optimizations☆50Updated last year
- A benchmark suite for evaluating FaaS scheduler.☆22Updated 2 years ago
- ☆17Updated 2 years ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆49Updated 2 years ago
- Help Rather Than Recycle: Alleviating Cold Startup in Serverless Computing Through Inter-Function Container Sharing☆48Updated 2 years ago
- An OS kernel module for fast **remote** fork using advanced datacenter networking (RDMA).☆60Updated last month
- A universal workflow system for exactly-once DAGs☆23Updated last year
- ☆26Updated 6 months ago
- Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …☆33Updated 10 months ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆19Updated 3 months ago
- Virtual Memory Abstraction for Serverless Architectures☆46Updated 3 years ago
- FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute (USENIX ATC'21)☆54Updated 3 years ago
- A resilient distributed training framework☆93Updated 11 months ago
- Molecule's artifact for ASPLOS'22☆29Updated 3 years ago
- ☆49Updated 2 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆94Updated 2 years ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆52Updated 10 months ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆29Updated 4 months ago
- Dirigent: Lightweight Serverless Orchestration☆37Updated 3 months ago
- ☆47Updated 3 months ago
- An interference-aware scheduler for fine-grained GPU sharing☆129Updated 2 months ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆113Updated last year
- Integrated Training Platform (ITP) traces used in ElasticFlow paper.☆29Updated 2 years ago