thu-pacman / lab-guideLinks
Everything about PACMAN!
☆14Updated this week
Alternatives and similar repositories for lab-guide
Users that are interested in lab-guide are comparing it to the libraries listed below
Sorting:
- My paper/code reading notes in Chinese☆46Updated 6 months ago
- ngAP's artifact for ASPLOS'24☆24Updated 4 months ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆59Updated 3 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆57Updated last year
- Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.☆52Updated 2 years ago
- Rebuild YatSenOS On RISC-V 64.☆22Updated 3 years ago
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.☆69Updated 2 years ago
- PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.☆25Updated 2 weeks ago
- A Factored System for Sample-based GNN Training over GPUs☆44Updated 2 years ago
- FGNN's artifact evaluation (EuroSys 2022)☆17Updated 3 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆41Updated last year
- Handwritten GEMM using Intel AMX (Advanced Matrix Extension)☆17Updated 11 months ago
- ☆28Updated last year
- ☆36Updated last year
- A hybrid partitioner based quantum circuit simulation system on GPU☆48Updated 3 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆44Updated 3 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆32Updated 10 months ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆30Updated 3 years ago
- My Paper Reading Lists and Notes.☆21Updated 3 weeks ago
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆18Updated 2 years ago
- Horizontal Fusion☆24Updated 3 years ago
- ☆14Updated last month
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆55Updated last year
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated last year
- An efficient concurrent graph processing system☆46Updated 4 years ago
- ☆23Updated 2 years ago
- WaferLLM: Large Language Model Inference at Wafer Scale☆77Updated last month
- Exploring CXL on QEMU Emulation☆30Updated 9 months ago
- A GPU FP32 computation method with Tensor Cores.☆23Updated 2 weeks ago
- Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training☆24Updated last year