wangdh15 / cs149Links
☆12Updated 3 years ago
Alternatives and similar repositories for cs149
Users that are interested in cs149 are comparing it to the libraries listed below
Sorting:
- A scheduling framework for multitasking over diverse XPUs, including GPUs, NPUs, ASICs, and FPGAs☆154Updated 2 weeks ago
- CS149 xmake version☆46Updated 2 years ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Updated last year
- ngAP's artifact for ASPLOS'24☆25Updated 6 months ago
- Here is a final lab of Compiler in USTC, focusing on MLIR☆20Updated 5 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆104Updated 3 years ago
- My Paper Reading Lists and Notes.☆21Updated 2 months ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆44Updated 3 years ago
- From Minimal GEMM to Everything☆98Updated last month
- A Progam-Behavior-Guided Far Memory System☆35Updated 2 years ago
- Artifacts of EuroSys'24 paper "Exploring Performance and Cost Optimization with ASIC-Based CXL Memory"☆31Updated last year
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆41Updated 8 months ago
- ☆37Updated last year
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Updated last year
- Machine Learning Compiler Road Map☆46Updated 2 years ago
- A toy compiler written in C++17 that translates SysY (a C-like toy language) into ARM-v7a assembly.☆146Updated 4 years ago
- A PyTorch-like deep learning framework. Just for fun.☆158Updated 2 years ago
- ☆79Updated 3 years ago
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆33Updated last year
- Learning materials for Stanford CS149 : Parallel Computing☆270Updated 4 years ago
- SYSU-ARCH is a LAB that focuses on the use and extending of simulators.☆11Updated 3 years ago
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆19Updated 2 years ago
- This repo stores a more profound view of Computer Architecture: A Quantitative Approach that tells multi-tenancy, virtualize, fine graine…☆29Updated last month
- Codes & examples for "CUDA - From Correctness to Performance"☆121Updated last year
- Rebuild YatSenOS On RISC-V 64.☆22Updated 4 years ago
- Tigon: A Distributed Database for a CXL Pod [OSDI '25]☆44Updated 2 months ago
- 《自己动手写AI编译器》☆33Updated last year
- ☆47Updated 6 months ago
- Compiler development environment.☆21Updated 2 months ago
- Website for Artifact Evaluation at EuroSys, SOSP, OSDI, ATC☆49Updated last week