Kyrie-Zhao/awesome-real-time-AI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Kyrie-Zhao/awesome-real-time-AI)

Kyrie-Zhao / awesome-real-time-AI

This is a list of awesome edgeAI inference related papers.

☆98

Alternatives and similar repositories for awesome-real-time-AI

Users that are interested in awesome-real-time-AI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SJTU-IPADS / disb
View on GitHub
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆58Aug 21, 2024Updated last year
caoting-dotcom / multiBranchModel
View on GitHub
Multi-branch model for concurrent execution
☆18Jun 27, 2023Updated 3 years ago
UbiquitousLearning / Paper-list-resource-efficient-large-language-model
View on GitHub
☆103Jan 17, 2024Updated 2 years ago
SJTU-IPADS / reef
View on GitHub
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆108Dec 24, 2022Updated 3 years ago
130B848 / ipads-tutorial07
View on GitHub
☆10Dec 8, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UofT-EcoSystem / DietCode
View on GitHub
DietCode Code Release
☆65Jul 21, 2022Updated 3 years ago
JonnyKong / AccuMO
View on GitHub
[MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality
☆17Oct 8, 2023Updated 2 years ago
S-Lab-System-Group / Primo
View on GitHub
Primo: Practical Learning-Augmented Systems with Interpretable Models
☆19Dec 26, 2023Updated 2 years ago
pittisl / ElasticTrainer
View on GitHub
Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)
☆14Nov 1, 2023Updated 2 years ago
zhaiyi000 / tlp
View on GitHub
☆42Apr 25, 2024Updated 2 years ago
swagshaw / Awesome-Cloud-Edge-AI
View on GitHub
A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…
☆33Jan 4, 2022Updated 4 years ago
limenghao / AdaTune
View on GitHub
This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).
☆14May 16, 2021Updated 5 years ago
wenh18 / AdaptiveNet
View on GitHub
☆16Oct 3, 2023Updated 2 years ago
pittisl / AgileNN
View on GitHub
Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)
☆18Apr 13, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ysyisyourbrother / awesome-on-device-AI
View on GitHub
A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…
☆53Apr 29, 2026Updated 2 months ago
eth-easl / orion
View on GitHub
An interference-aware scheduler for fine-grained GPU sharing
☆163Nov 26, 2025Updated 7 months ago
SJTU-IPADS / reef-artifacts
View on GitHub
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆43May 29, 2022Updated 4 years ago
Raphael-Hao / Abacus
View on GitHub
☆38Jun 27, 2025Updated last year
xumengwei / Edge-AI-Paper-List
View on GitHub
☆215Jan 17, 2024Updated 2 years ago
UbiquitousLearning / MobileFM
View on GitHub
One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…
☆30Mar 5, 2024Updated 2 years ago
yuzehh / VI-Map
View on GitHub
☆28Oct 25, 2023Updated 2 years ago
zhuzilin / pytorch-malloc
View on GitHub
An external memory allocator example for PyTorch.
☆16Aug 10, 2025Updated 11 months ago
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
TiledTensor / TiledKernel
View on GitHub
TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.
☆19May 12, 2024Updated 2 years ago
Raphael-Hao / brainstorm
View on GitHub
Compiler for Dynamic Neural Networks
☆45Nov 13, 2023Updated 2 years ago
iree-org / iree-torch
View on GitHub
Torch Frontend for IREE
☆26Dec 21, 2023Updated 2 years ago
microsoft / Moonlit
View on GitHub
This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.
☆88Oct 25, 2024Updated last year
utnslab / Medes
View on GitHub
Deduplication over dis-aggregated memory for Serverless Computing
☆14Mar 21, 2022Updated 4 years ago
microsoft / nn-Meter
View on GitHub
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
☆364Jul 30, 2024Updated last year
aFuerst / faascache-sim
View on GitHub
☆18Oct 31, 2022Updated 3 years ago
oscomp / proj23-lightweight-hypervisor
View on GitHub
在RISC-V处理器上实现一个轻量级的Hypervisor。
☆12Dec 25, 2020Updated 5 years ago
yuanmu97 / PacketGame
View on GitHub
[SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale
☆15Jul 1, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jack-willturner / nas-as-program-transformation-exploration
View on GitHub
The code for our paper "Neural Architecture Search as Program Transformation Exploration"
☆17Apr 28, 2021Updated 5 years ago
S-Lab-System-Group / Lucid
View on GitHub
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆61May 21, 2023Updated 3 years ago
IntelliSys-Lab / RainbowCake-ASPLOS24
View on GitHub
☆42Nov 5, 2023Updated 2 years ago
WUSTL-CSPL / Kairos-Userspace
View on GitHub
☆24Jul 8, 2024Updated 2 years ago
S-Lab-System-Group / Awesome-ML-for-System
View on GitHub
SOTA Learning-augmented Systems
☆37May 21, 2022Updated 4 years ago
gudiandian / ElasticFlow
View on GitHub
☆17May 10, 2024Updated 2 years ago
LLMServe / dLoRA-artifact
View on GitHub
☆32May 28, 2024Updated 2 years ago