kcxain / dlsysLinks

My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022

☆40

Alternatives and similar repositories for dlsys

Users that are interested in dlsys are comparing it to the libraries listed below

Sorting:

Sunt-ing / stick
A PyTorch-like deep learning framework. Just for fun.
☆156Updated last year
YuanchengFang / dlsys_solution
Homework solutions for CMU 10-414/714 – Deep Learning Systems: Algorithms and Implementation
☆45Updated 2 years ago
PKUFlyingPig / CMU10-714
Learning material for CMU10-714: Deep Learning System
☆264Updated last year
MLSys-Learner-Resources / Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
☆263Updated 6 months ago
eedalong / ECE408
Code base and slides for ECE408：Applied Parallel Programming On GPU.
☆128Updated 4 years ago
ZonePG / cs-notes
my cs notes
☆53Updated 9 months ago
interestingLSY / CUDA-From-Correctness-To-Performance-Code
Codes & examples for "CUDA - From Correctness to Performance"
☆103Updated 9 months ago
PKUFlyingPig / CS149-parallel-computing
Learning materials for Stanford CS149 : Parallel Computing
☆231Updated 4 years ago
PKUFlyingPig / MIT6.5940_TinyML
Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing
☆51Updated 6 months ago
SiriusNEO / Triton-Puzzles-Lite
Puzzles for learning Triton, play it with minimal environment configuration!
☆442Updated 8 months ago
mdy666 / mdy_triton
☆140Updated last month
ysj1173886760 / PyToy
deep learning framework from scratch
☆30Updated 3 years ago
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆264Updated 4 months ago
guanrenyang / Programming-Massively-Parallel-Processors
Solution of Programming Massively Parallel Processors
☆47Updated last year
zhang-tlgg / HPC-Lab
HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.
☆25Updated 2 years ago
lambda7xx / awesome-AI-system
paper and its code for AI System
☆318Updated 3 months ago
aschuh703 / ECE408
☆47Updated last year
BBuf / how-to-learn-deep-learning-framework
how to learn PyTorch and OneFlow
☆445Updated last year
JackonYang / hands-on-tvm
hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.
☆49Updated 2 years ago
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆220Updated this week
sunkx109 / My-Torch-Extension
A minimalist and extensible PyTorch extension for implementing custom backend operators in PyTorch.
☆33Updated last year
TreeAI-Lab / Awesome-KV-Cache-Management
This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…
☆172Updated this week
harleyszhang / llm_counts
llm theoretical performance analysis tools and support params, flops, memory and latency analysis.
☆99Updated 3 weeks ago
chenhongyu2048 / LLM-inference-optimization-paper
Summary of some awesome work for optimizing LLM inference
☆92Updated 2 months ago
XiaoSong9905 / HPC-Notes
Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]
☆69Updated 3 years ago
yifanlu0227 / MIT-6.5940
All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai
☆177Updated last year
ifromeast / cuda_learning
learning how CUDA works
☆291Updated 5 months ago
l1nkr / DL-Compiler-Navigation
Machine Learning Compiler Road Map
☆43Updated last year
DicardoX / Research-Space
This repository is established to store personal notes and annotated papers during daily research.
☆138Updated this week
66RING / tiny-flash-attention
flash attention tutorial written in python, triton, cuda, cutlass
☆398Updated 2 months ago