jeho-lee / Awesome-On-Device-AI-SystemsLinks

☆93

Alternatives and similar repositories for Awesome-On-Device-AI-Systems

Users that are interested in Awesome-On-Device-AI-Systems are comparing it to the libraries listed below

Sorting:

xumengwei / Edge-AI-Paper-List
☆211Updated last year
Kyrie-Zhao / awesome-real-time-AI
This is a list of awesome edgeAI inference related papers.
☆98Updated last year
csu-eis / CoDL
☆78Updated 2 years ago
ztt-21 / zTT
zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation
☆26Updated 4 years ago
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆305Updated last month
chenhongyu2048 / LLM-inference-optimization-paper
Summary of some awesome work for optimizing LLM inference
☆138Updated 2 weeks ago
mrsnu / band
Multi-DNN Inference Engine for Heterogeneous Mobile Processors
☆35Updated last year
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆96Updated last year
infinigence / SpecEE
Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)
☆67Updated 6 months ago
goliaro / specinfer-ae
☆23Updated last year
yifanlu0227 / MIT-6.5940
All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai
☆184Updated last year
xxxxyu / FlexNN
Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"
☆56Updated 10 months ago
harleyszhang / llm_counts
llm theoretical performance analysis tools and support params, flops, memory and latency analysis.
☆112Updated 4 months ago
SNU-ARC / any-precision-llm
[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs
☆120Updated 4 months ago
DicardoX / Research-Space
This repository is established to store personal notes and annotated papers during daily research.
☆161Updated last week
hahnyuan / LLM-Viewer
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…
☆578Updated last year
casys-kaist / LLMServingSim
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale
☆158Updated 4 months ago
UbiquitousLearning / Paper-list-resource-efficient-large-language-model
☆101Updated last year
ysyisyourbrother / awesome-on-device-AI
A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…
☆44Updated 2 years ago
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆279Updated 8 months ago
ARM-software / kleidiai
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
☆95Updated this week
fredrickang / LaLaRAND
LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks
☆15Updated 3 years ago
mental2008 / awesome-papers
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…
☆134Updated 2 weeks ago
yifanlu0227 / LLaMA2-7B-on-laptop
Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.
☆18Updated last year
Zhen-Dong / Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
☆755Updated 7 months ago
pprp / Awesome-LLM-Quantization
Awesome list for LLM quantization
☆353Updated last month
PrincetonUniversity / LLMCompass
☆205Updated 3 weeks ago
UbiquitousLearning / Mandheling-DSP-Training
The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]
☆19Updated 3 years ago
mutinifni / splitwise-sim
LLM serving cluster simulator
☆120Updated last year
mit-han-lab / parallel-computing-tutorial
☆176Updated 2 years ago