This is a list of awesome edgeAI inference related papers.
☆99Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for awesome-real-time-AI
Users that are interested in awesome-real-time-AI are comparing it to the libraries listed below
Sorting:
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- ☆102Jan 17, 2024Updated 2 years ago
- 在RISC-V处理器上实现一个轻量级的Hypervisor。☆12Dec 25, 2020Updated 5 years ago
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 2 years ago
- ☆41Apr 25, 2024Updated last year
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆143Mar 31, 2023Updated 2 years ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆30Mar 5, 2024Updated 2 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆362Jul 30, 2024Updated last year
- ☆16Oct 3, 2023Updated 2 years ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆33Jan 4, 2022Updated 4 years ago
- ☆15Jul 25, 2023Updated 2 years ago
- DietCode Code Release☆65Jul 21, 2022Updated 3 years ago
- Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)☆18Apr 13, 2023Updated 2 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- ☆212Jan 17, 2024Updated 2 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆121Oct 26, 2022Updated 3 years ago
- Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs☆59May 21, 2023Updated 2 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- Torch Frontend for IREE☆25Dec 21, 2023Updated 2 years ago
- Repo for transient training paper at ICAC 2019.☆11Oct 5, 2022Updated 3 years ago
- SOTA Learning-augmented Systems☆37May 21, 2022Updated 3 years ago
- ☆42Nov 5, 2023Updated 2 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆48Feb 13, 2026Updated 3 weeks ago
- ☆38Jun 27, 2025Updated 8 months ago
- ☆27Oct 25, 2023Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆199Apr 27, 2022Updated 3 years ago
- 微信Ipad协议golang版本,基于grpc的实现策略。这套代码需要通过gprc服务端组包解包才可以正常使用☆12Jul 8, 2019Updated 6 years ago
- ☆12Nov 8, 2024Updated last year
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- An interference-aware scheduler for fine-grained GPU sharing☆160Nov 26, 2025Updated 3 months ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆43May 29, 2022Updated 3 years ago
- A collection of papers on LLM applications in the IoT field.☆18Jan 21, 2026Updated last month
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated last year
- Generative Models for Image Captioning☆10Jun 7, 2017Updated 8 years ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- PLCT实验室2019年开放日资料(OpenDay-2019)☆11Dec 20, 2019Updated 6 years ago
- OSDI 2023 Welder, deeplearning compiler☆32Nov 24, 2023Updated 2 years ago
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆28May 10, 2021Updated 4 years ago
- ☆166Jul 22, 2024Updated last year