wangdh15 / cs149Links
☆12Updated 3 years ago
Alternatives and similar repositories for cs149
Users that are interested in cs149 are comparing it to the libraries listed below
Sorting:
- A scheduling framework for multitasking over diverse XPUs, including GPUs, NPUs, ASICs, and FPGAs☆120Updated 3 weeks ago
- CS149 xmake version☆42Updated last year
- 高级计算机体系结构2020,吴俊敏老师,中科大研究生课程☆73Updated last year
- GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving☆18Updated 2 months ago
- A toy compiler written in C++17 that translates SysY (a C-like toy language) into ARM-v7a assembly.☆144Updated 4 years ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Updated last year
- ☆259Updated last week
- Here is a final lab of Compiler in USTC, focusing on MLIR☆19Updated 4 years ago
- Learning materials for Stanford CS149 : Parallel Computing☆245Updated 4 years ago
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Updated 2 months ago
- ngAP's artifact for ASPLOS'24☆24Updated 2 months ago
- Codes & examples for "CUDA - From Correctness to Performance"☆114Updated 11 months ago
- Documentation for YatCPU☆53Updated last year
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆101Updated 2 years ago
- ☆42Updated 3 months ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆43Updated 3 years ago
- WaferLLM: Large Language Model Inference at Wafer Scale☆59Updated last week
- A PyTorch-like deep learning framework. Just for fun.☆156Updated 2 years ago
- My Paper Reading Lists and Notes.☆20Updated last week
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆74Updated 3 years ago
- ChocoPy LLVM Repo☆77Updated 2 years ago
- This repo stores a more profound view of Computer Architecture: A Quantitative Approach that tells multi-tenancy, virtualize, fine graine…☆26Updated last year
- Machine Learning Compiler Road Map☆44Updated 2 years ago
- 《自己动手写AI编译器》☆28Updated last year
- ☆23Updated last year
- From Minimal GEMM to Everything☆55Updated last week
- An optimizing compiler targeting armv7 and risc-v32☆62Updated 8 months ago
- Horizontal Fusion☆24Updated 3 years ago
- ☆40Updated 2 years ago
- SYSU-ARCH is a LAB that focuses on the use and extending of simulators.☆11Updated 2 years ago