yifanlu0227 / MIT-6.5940

All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai

☆156

Alternatives and similar repositories for MIT-6.5940:

Users that are interested in MIT-6.5940 are comparing it to the libraries listed below

PKUFlyingPig / CMU10-714
Learning material for CMU10-714: Deep Learning System
☆233Updated 9 months ago
mit-han-lab / parallel-computing-tutorial
☆156Updated last year
yifanlu0227 / LLaMA2-7B-on-laptop
Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.
☆14Updated last year
SiriusNEO / Triton-Puzzles-Lite
Puzzles for learning Triton, play it with minimal environment configuration!
☆225Updated 2 months ago
October2001 / Awesome-KV-Cache-Compression
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
☆296Updated 2 weeks ago
Zhen-Dong / Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
☆531Updated 2 months ago
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆220Updated last month
YuanchengFang / dlsys_solution
Homework solutions for CMU 10-414/714 – Deep Learning Systems: Algorithms and Implementation
☆43Updated 2 years ago
Zefan-Cai / Awesome-LLM-KV-Cache
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
☆206Updated 2 months ago
goliaro / specinfer-ae
☆18Updated 11 months ago
Sunt-ing / stick
A PyTorch-like deep learning framework. Just for fun.
☆143Updated last year
66RING / tiny-flash-attention
flash attention tutorial written in python, triton, cuda, cutlass
☆260Updated last month
MLSys-Learner-Resources / Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
☆177Updated last month
SNU-ARC / any-precision-llm
[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs
☆96Updated last month
ifromeast / cuda_learning
learning how CUDA works
☆200Updated 6 months ago
harleyszhang / llm_counts
llm theoretical performance analysis tools and support params, flops, memory and latency analysis.
☆78Updated last month
KnowingNothing / MatmulTutorial
A Easy-to-understand TensorOp Matmul Tutorial
☆316Updated 4 months ago
Shenggan / awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
☆214Updated 4 months ago
JackonYang / hands-on-tvm
hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.
☆45Updated last year
DicardoX / Research-Space
This repository is established to store personal notes and annotated papers during daily research.
☆109Updated this week
hemingkx / SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
☆595Updated this week
byungsoo-oh / ml-systems-papers
Curated collection of papers in machine learning systems
☆234Updated this week
FMInference / DejaVu
☆314Updated 10 months ago
pprp / Awesome-LLM-Prune
Awesome list for LLM pruning.
☆202Updated 2 months ago
lambda7xx / awesome-AI-system
paper and its code for AI System
☆272Updated 3 weeks ago
eedalong / ECE408
Code base and slides for ECE408：Applied Parallel Programming On GPU.
☆120Updated 3 years ago
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆73Updated 8 months ago
thu-nics / qllm-eval
Code Repository of Evaluating Quantized Large Language Models
☆116Updated 5 months ago
hahnyuan / LLM-Viewer
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…
☆391Updated 5 months ago
liyunqianggyn / Awesome-LLMs-Pruning
Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.
☆70Updated 2 months ago