WalterBabyRudin / CoursewareLinks
☆11Updated 4 years ago
Alternatives and similar repositories for Courseware
Users that are interested in Courseware are comparing it to the libraries listed below
Sorting:
- SOTA Learning-augmented Systems☆36Updated 3 years ago
- ☆73Updated 3 years ago
- system paper reading notes☆245Updated 3 years ago
- A PyTorch-like deep learning framework. Just for fun.☆154Updated last year
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆124Updated 3 years ago
- Primo: Practical Learning-Augmented Systems with Interpretable Models☆19Updated last year
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆36Updated 3 months ago
- ☆24Updated 11 months ago
- Learning material for CMU10-714: Deep Learning System☆251Updated last year
- Paper-reading notes for Berkeley OS prelim exam.☆11Updated 9 months ago
- ☆37Updated 7 months ago
- Codes & examples for "CUDA - From Correctness to Performance"☆98Updated 7 months ago
- ☆18Updated last year
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆25Updated last year
- ☆20Updated 3 years ago
- [ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo☆24Updated last month
- deep learning framework from scratch☆28Updated 3 years ago
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆37Updated last year
- Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …☆35Updated last year
- My paper/code reading notes in Chinese☆46Updated last year
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆23Updated 3 weeks ago
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆174Updated 7 months ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆81Updated last year
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆67Updated 2 years ago
- ☆37Updated 5 months ago
- Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training☆24Updated last year
- Personal blog + reading notes on system-ish papers☆15Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆67Updated 4 years ago
- CS294/194-196 Large Language Model Agents☆12Updated 3 months ago
- [SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference☆52Updated 4 months ago