PKUFlyingPig / MIT6.5940_TinyMLLinks

Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing

☆60

Alternatives and similar repositories for MIT6.5940_TinyML

Users that are interested in MIT6.5940_TinyML are comparing it to the libraries listed below

Sorting:

MLSys-Learner-Resources / Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
☆292Updated 9 months ago
LDLINGLINGLING / nano_vllm_note
注释的nano_vllm仓库，并且完成了MiniCPM4的适配以及注册新模型的功能
☆81Updated 2 months ago
chenhongyu2048 / LLM-inference-optimization-paper
Summary of some awesome work for optimizing LLM inference
☆120Updated 4 months ago
mdy666 / mdy_triton
☆148Updated 3 months ago
interestingLSY / CUDA-From-Correctness-To-Performance-Code
Codes & examples for "CUDA - From Correctness to Performance"
☆114Updated last year
PKUFlyingPig / CMU10-714
Learning material for CMU10-714: Deep Learning System
☆279Updated last year
TreeAI-Lab / Awesome-KV-Cache-Management
This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…
☆221Updated 2 months ago
Zefan-Cai / Awesome-LLM-KV-Cache
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
☆376Updated 7 months ago
harleyszhang / llm_counts
llm theoretical performance analysis tools and support params, flops, memory and latency analysis.
☆108Updated 3 months ago
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆285Updated last month
interestingLSY / swiftLLM
A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …
☆278Updated 4 months ago
sunkx109 / GPUs-Specs
Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM
☆63Updated 2 months ago
shishishu / LLM-Inference-Acceleration
LLM Inference with Deep Learning Accelerator.
☆52Updated 9 months ago
sihyeong / Awesome-LLM-Inference-Engine
☆138Updated 4 months ago
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆278Updated 7 months ago
Sunt-ing / stick
A PyTorch-like deep learning framework. Just for fun.
☆156Updated 2 years ago
PKU-SEC-Lab / HybriMoE
[DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"
☆75Updated 4 months ago
yifanlu0227 / MIT-6.5940
All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai
☆181Updated last year
ifromeast / AI_analysis
analyse problems of AI with Math and Code
☆26Updated 2 months ago
ZonePG / cs-notes
my cs notes
☆56Updated last year
hao-ai-lab / vllm-ltr
[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank
☆60Updated 11 months ago
liangyuwang / Tiny-Megatron
Tiny-Megatron, a minimalistic re-implementation of the Megatron library
☆16Updated last month
PKU-SEC-Lab / AdapMoE
Code release for AdapMoE accepted by ICCAD 2024
☆34Updated 5 months ago
gty111 / gLLM
gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling
☆42Updated 3 weeks ago
fanlai0990 / CS598
Systems for GenAI
☆144Updated 6 months ago
HarryWu99 / llm_kvcache_sparsity
Implement some method of LLM KV Cache Sparsity
☆39Updated last year
smart-lty / ParallelSpeculativeDecoding
[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length
☆120Updated 6 months ago
Hsword / Awesome-Machine-Learning-System-Papers
☆77Updated 3 years ago
DeepLink-org / DLSlime
DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit
☆70Updated this week
YaoJiayi / CacheBlend
☆141Updated 3 months ago