Kyrie-Zhao / awesome-real-time-AIView external linksLinks
This is a list of awesome edgeAI inference related papers.
☆99Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for awesome-real-time-AI
Users that are interested in awesome-real-time-AI are comparing it to the libraries listed below
Sorting:
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Aug 21, 2024Updated last year
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Oct 8, 2023Updated 2 years ago
- Primo: Practical Learning-Augmented Systems with Interpretable Models☆19Dec 26, 2023Updated 2 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆104Dec 24, 2022Updated 3 years ago
- ☆102Jan 17, 2024Updated 2 years ago
- 在RISC-V处理器上实现一个轻量级的Hypervisor。☆12Dec 25, 2020Updated 5 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- ☆41Apr 25, 2024Updated last year
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆142Mar 31, 2023Updated 2 years ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆29Mar 5, 2024Updated last year
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆363Jul 30, 2024Updated last year
- ☆16Oct 3, 2023Updated 2 years ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆33Jan 4, 2022Updated 4 years ago
- Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)☆18Apr 13, 2023Updated 2 years ago
- DietCode Code Release☆65Jul 21, 2022Updated 3 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago
- ☆15Jul 25, 2023Updated 2 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- ☆213Jan 17, 2024Updated 2 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆121Oct 26, 2022Updated 3 years ago
- Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs☆58May 21, 2023Updated 2 years ago
- Torch Frontend for IREE☆25Dec 21, 2023Updated 2 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- ☆10Jun 18, 2024Updated last year
- Repo for transient training paper at ICAC 2019.☆11Oct 5, 2022Updated 3 years ago
- SOTA Learning-augmented Systems☆37May 21, 2022Updated 3 years ago
- ☆42Nov 5, 2023Updated 2 years ago
- ☆38Jun 27, 2025Updated 7 months ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆47Updated this week
- ☆27Oct 25, 2023Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆199Apr 27, 2022Updated 3 years ago
- A collection of papers on LLM applications in the IoT field.☆18Jan 21, 2026Updated 3 weeks ago
- 微信Ipad协议golang版本,基于grpc的实现策略。这套代码需要通过gprc服务端组包解包才可以正常使用☆12Jul 8, 2019Updated 6 years ago
- ☆12Nov 8, 2024Updated last year
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆47Apr 4, 2022Updated 3 years ago
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- Generative Models for Image Captioning☆10Jun 7, 2017Updated 8 years ago