jiadong5 / ECE408_FA23_UIUC
My GitHub Repo for UIUC ECE408 Applied Parallel Programming, mainly focus on CUDA programming and algorithm implementation.
☆14Updated last year
Alternatives and similar repositories for ECE408_FA23_UIUC:
Users that are interested in ECE408_FA23_UIUC are comparing it to the libraries listed below
- Learning material for CMU10-714: Deep Learning System☆233Updated 9 months ago
- Learning materials for Stanford CS149 : Parallel Computing☆202Updated 3 years ago
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆156Updated last year
- ☆48Updated last year
- ECE408 (Applied Parallel Programming) Fall 2022 MP☆10Updated last year
- Stanford CS149 -- Assignment 1☆87Updated 4 months ago
- Codes & examples for "CUDA - From Correctness to Performance"☆80Updated 3 months ago
- ☆124Updated 6 months ago
- Solution of Programming Massively Parallel Processors☆40Updated last year
- A PyTorch-like deep learning framework. Just for fun.☆142Updated last year
- Xiao's CUDA Optimization Guide [Active Adding New Contents]☆264Updated 2 years ago
- Homework solutions for CMU 10-414/714 – Deep Learning Systems: Algorithms and Implementation☆43Updated 2 years ago
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆61Updated 2 years ago
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆13Updated last year
- ☆15Updated this week
- my implementation for the CS61C labs in 2020 summer version☆71Updated 4 years ago
- ☆156Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆64Updated 4 years ago
- Systems for GenAI☆102Updated this week
- CS149 xmake version☆43Updated last year
- IMPACT GPU Algorithms Teaching Labs☆56Updated last year
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆120Updated 3 years ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆179Updated last month
- A Easy-to-understand TensorOp Matmul Tutorial☆316Updated 5 months ago
- 🎉CUDA 笔记 / 高频面试题汇总 / C++笔记,个人笔记,更新随缘: sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.☆20Updated 11 months ago
- ☆22Updated this week
- This repo contains the Assignments from Cornell Tech's ECE 5545 - Machine Learning Hardware and Systems offered in Spring 2023☆27Updated last year
- The pintos source distribution for PKU Operating System Course projects☆46Updated this week
- Examples of CUDA implementations by Cutlass CuTe☆138Updated 2 weeks ago
- UC Berkeley signal and system course☆62Updated 4 years ago