zhang-tlgg / HPC-LabLinks
HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.
☆24Updated 2 years ago
Alternatives and similar repositories for HPC-Lab
Users that are interested in HPC-Lab are comparing it to the libraries listed below
Sorting:
- Learning materials for Stanford CS149 : Parallel Computing☆267Updated 4 years ago
- ☆143Updated 3 weeks ago
- Codes & examples for "CUDA - From Correctness to Performance"☆120Updated last year
- Repository for HPCGame 1st Problems.☆69Updated last year
- ☆281Updated 2 months ago
- Summary of some awesome work for optimizing LLM inference☆162Updated last month
- This repository is established to store personal notes and annotated papers during daily research.☆173Updated last week
- Documentation for HPC course☆159Updated 7 months ago
- ☆48Updated 2 years ago
- Homework solutions for CMU 10-414/714 – Deep Learning Systems: Algorithms and Implementation☆46Updated 3 years ago
- paper and its code for AI System☆340Updated last month
- Solution of Programming Massively Parallel Processors☆49Updated last year
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆311Updated last year
- ☆79Updated 3 years ago
- Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…☆146Updated this week
- A Throughput-Optimized Pipeline Parallel Inference System for Large Language Models☆47Updated 2 weeks ago
- A PyTorch-like deep learning framework. Just for fun.☆157Updated 2 years ago
- ☆74Updated 2 months ago
- Flash Attention from Scratch on CUDA Ampere☆115Updated 4 months ago
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆142Updated 4 years ago
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…☆282Updated 10 months ago
- ☆54Updated 3 months ago
- Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.☆51Updated 2 months ago
- From Minimal GEMM to Everything☆95Updated last week
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆105Updated 3 years ago
- ☆15Updated last year
- ☆14Updated this week
- GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving☆19Updated 5 months ago
- OpenCAEPoro for ASC 2024☆37Updated 2 years ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Updated last year