MachineLearningSystem/25ASPLOS-Medusa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MachineLearningSystem/25ASPLOS-Medusa)

MachineLearningSystem / 25ASPLOS-Medusa

Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]

☆12

Alternatives and similar repositories for 25ASPLOS-Medusa

Users that are interested in 25ASPLOS-Medusa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thustorage / Medusa
View on GitHub
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
☆47May 13, 2025Updated last year
microsoft / elasticflow-traces
View on GitHub
Integrated Training Platform (ITP) traces used in ElasticFlow paper.
☆31Dec 23, 2022Updated 3 years ago
pkusys / ElasticFlow
View on GitHub
Artifacts for our ASPLOS'23 paper ElasticFlow
☆56May 10, 2024Updated 2 years ago
appl-lab / CuTS
View on GitHub
☆13Sep 8, 2021Updated 4 years ago
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆11Dec 30, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zijwang / talkdown
View on GitHub
Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."
☆10Jan 26, 2020Updated 6 years ago
wanghongfei / hello
View on GitHub
Anonymous Chatting Website implemented by WebSocket(匿名在线聊天交友网站)
☆16Sep 17, 2015Updated 10 years ago
kygx-legend / vsgm
View on GitHub
☆11Nov 14, 2023Updated 2 years ago
13767004362 / SQLitePractice
View on GitHub
数据库案例：1.使用时间和日期函数，增，查时间字段。2.利用ContentProvider,CursorLoader,SQLite实现数据库的观察者模式。3.RxJava,SQLBrite实现数据库的观察者模式。4.拷贝外部db文件到数据库中
☆21May 11, 2017Updated 9 years ago
oska874 / process-scheduling-in-linux
View on GitHub
本文译自 University of Edinburgh 的 Volker Seeker 的 Process Scheduling in Linux ，介绍了 Linux 3.1 的任务调度机制。
☆11Aug 11, 2016Updated 9 years ago
csl-iisc / Trident-MICRO21-artifact
View on GitHub
☆10Sep 15, 2023Updated 2 years ago
LPD-EPFL / swarm-kv
View on GitHub
A fault-tolerant RDMA-based disaggregated key-value store with 1-RTT UPDATEs and GETs thanks to the SWARM replication protocol
☆14Sep 25, 2024Updated last year
fansy1990 / movie_recommend
View on GitHub
☆21Aug 27, 2016Updated 9 years ago
interestingLSY / swiftLLM
View on GitHub
A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …
☆329Jun 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
utnslab / Medes
View on GitHub
Deduplication over dis-aggregated memory for Serverless Computing
☆14Mar 21, 2022Updated 4 years ago
GTkernel / coeus-sim
View on GitHub
☆12Mar 22, 2022Updated 4 years ago
Monaco12138 / sr
View on GitHub
☆10Sep 14, 2023Updated 2 years ago
Codewarnings / Six-in-a-row
View on GitHub
该六子棋程序使用Java语言编写,内置AI落子,主要由阿尔法贝塔搜索+评估函数实现,存在一定的bug,智能方面还行吧
☆13Jul 24, 2021Updated 4 years ago
oscarlab / graphene-sgx-driver
View on GitHub
Linux kernel SGX driver for Graphene
☆12Nov 3, 2020Updated 5 years ago
DiscreteTom / dt-blog-boilerplate
View on GitHub
DiscreteTom's Blog Boilerplate.
☆10Mar 6, 2023Updated 3 years ago
ZhangJiaQiao / 2020-DBMS-project
View on GitHub
This is the final project of 2020 DBMS course in SYSU
☆10Jun 23, 2020Updated 6 years ago
vuhpdc / jellyfish
View on GitHub
Source code for Jellyfish, a soft real-time inference serving system
☆15Dec 20, 2022Updated 3 years ago
vineeths96 / Gradient-Compression
View on GitHub
We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while mai…
☆10Nov 14, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
acsl-technion / TPT
View on GitHub
☆15May 23, 2023Updated 3 years ago
hickory-dns / resolv-conf
View on GitHub
The /etc/resolv.conf file parser in rust
☆35Dec 29, 2025Updated 6 months ago
ningliu-iga / DAFNO
View on GitHub
Domain Agnostic Fourier Neural Operators (DAFNO)
☆20Sep 3, 2024Updated last year
yuanxinnn / APTMoE
View on GitHub
☆13Jun 29, 2024Updated 2 years ago
aisoft9 / JYCache
View on GitHub
DRAM/SSD hybrid caching system
☆15Mar 13, 2025Updated last year
wudu98 / autoGEMM
View on GitHub
☆15Dec 5, 2024Updated last year
hiddenlayer2020 / ML-Job-Scheduler-MLFS
View on GitHub
☆13Dec 18, 2020Updated 5 years ago
acsl-technion / hyperturtle
View on GitHub
Meta repository for the USENIX ATC'25 Publication "Accelerating Nested Virtualization with HyperTurtle"
☆17Mar 22, 2026Updated 3 months ago
nimish15shah / DAG_Processor
View on GitHub
A DAG processor and compiler for a tree-based spatial datapath.
☆16Aug 24, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mururu / rafute
View on GitHub
An implementation of Raft Consensus Algorithm in Elixir
☆21Apr 3, 2016Updated 10 years ago
yafuly / PromptNMT
View on GitHub
☆15Dec 2, 2022Updated 3 years ago
Sys-KU / LMServe
View on GitHub
A lightweight and fast LLM serving framework
☆15Mar 5, 2026Updated 4 months ago
gudiandian / ElasticFlow
View on GitHub
☆17May 10, 2024Updated 2 years ago
wu-kan / wuk_cupti_wrapper
View on GitHub
a simple API to use CUPTI
☆10Aug 19, 2025Updated 11 months ago
yijieZ / UniMem
View on GitHub
☆11Jun 5, 2024Updated 2 years ago
acryl-aaai / perf
View on GitHub
☆15Dec 13, 2024Updated last year