TankLabTJU / INFlessLinks

The source code of INFless，a native serverless platform for AI inference.

☆44

Alternatives and similar repositories for INFless

Users that are interested in INFless are comparing it to the libraries listed below

Sorting:

IntelliSys-Lab / RainbowCake-ASPLOS24
☆40Updated 2 years ago
All-less / faas-scheduling-benchmark
A benchmark suite for evaluating FaaS scheduler.
☆23Updated 3 years ago
msr-fiddle / synergy
☆51Updated 2 years ago
icanforce / Orion-OSDI22
Serverless optimizations
☆51Updated last year
JelixLi / Tetris
☆18Updated 2 years ago
aFuerst / faascache-sim
☆18Updated 3 years ago
lzjzx1122 / Pagurus
Help Rather Than Recycle: Alleviating Cold Startup in Serverless Computing Through Inter-Function Container Sharing
☆49Updated 3 years ago
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆55Updated last year
S-Lab-System-Group / Lucid
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆58Updated 2 years ago
lzjzx1122 / FaaSFlow
FaaSFlow: Enable Efficient Workflow Execution for Function-as-a-Service
☆79Updated last year
msr-fiddle / blox
☆44Updated last year
S-Lab-System-Group / HeliosData
Helios Traces from SenseTime
☆62Updated 3 years ago
pkusys / TGS
Artifacts for our NSDI'23 paper TGS
☆90Updated last year
MincYu / pheromone
☆45Updated 3 years ago
Thesys-lab / Helix-ASPLOS25
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆73Updated last month
msr-fiddle / philly-traces
☆198Updated 6 years ago
Rivendile / Muri
Artifacts for our SIGCOMM'22 paper Muri
☆44Updated last year
James-QiuHaoran / LLM-serving-with-proxy-models
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …
☆49Updated last year
MincYu / gillis-open-source
☆26Updated 2 years ago
Raphael-Hao / Abacus
☆38Updated 5 months ago
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆153Updated last week
Molecule-Serverless / molecule-artifact
Molecule's artifact for ASPLOS'22
☆29Updated 3 years ago
zzhou612 / aquatope
AQUATOPE: QoS-and-Uncertainty-Aware Resource Management for Multi-Stage Serverless Workflows (ASPLOS'23)
☆23Updated last year
uclasystem / bamboo
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆54Updated 2 years ago
casys-kaist / glet
☆53Updated 11 months ago
siasosp23 / artifacts
☆24Updated 2 years ago
thustorage / Medusa
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
☆40Updated 6 months ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆103Updated 2 years ago
romilbhardwaj / cilantro
Source code for OSDI 2023 paper titled "Cilantro - Performance-Aware Resource Allocation for General Objectives via Online Feedback"
☆40Updated 2 years ago
gudiandian / ElasticFlow
☆16Updated last year