thu-pacman / lab-guideLinks
Everything about PACMAN!
☆13Updated last month
Alternatives and similar repositories for lab-guide
Users that are interested in lab-guide are comparing it to the libraries listed below
Sorting:
- A Factored System for Sample-based GNN Training over GPUs☆42Updated last year
- ☆36Updated last year
- ngAP's artifact for ASPLOS'24☆24Updated last month
- Tigon: A Distributed Database for a CXL Pod [OSDI '25]☆27Updated last month
- Universal Presentation: A Header-only C++ Library to Cout STL containers and more☆18Updated last year
- An efficient concurrent graph processing system☆46Updated 3 years ago
- MESMERIC: A Software-based NVM Emulator Supporting Read/Write Asymmetric Latencies☆10Updated 4 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆28Updated 4 years ago
- My paper/code reading notes in Chinese☆46Updated last month
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆53Updated 11 months ago
- A GPU FP32 computation method with Tensor Cores.☆21Updated 2 years ago
- Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.☆49Updated last year
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆40Updated last year
- PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.☆21Updated 2 months ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆42Updated 3 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated last year
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆32Updated last year
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.☆66Updated 2 years ago
- Efficient Compute-Communication Overlap for Distributed LLM Inference☆24Updated 3 weeks ago
- ☆18Updated 3 years ago
- MemLiner is a remote-memory-friendly runtime system.☆31Updated 2 years ago
- FGNN's artifact evaluation (EuroSys 2022)☆17Updated 3 years ago
- Rebuild YatSenOS On RISC-V 64.☆20Updated 3 years ago
- A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs☆55Updated 4 years ago
- Deduplication over dis-aggregated memory for Serverless Computing☆13Updated 3 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆29Updated 5 months ago
- ☆13Updated 3 years ago
- ☆10Updated last year
- Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training☆25Updated last year
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆73Updated 2 years ago