chhzh123 / KrillLinks
An efficient concurrent graph processing system
☆46Updated 3 years ago
Alternatives and similar repositories for Krill
Users that are interested in Krill are comparing it to the libraries listed below
Sorting:
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆25Updated 3 months ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆54Updated 2 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆49Updated last year
- ☆16Updated 2 years ago
- ☆70Updated 3 years ago
- My paper/code reading notes in Chinese☆46Updated last year
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆18Updated 2 years ago
- Graph Sampling using GPU☆52Updated 3 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆42Updated 3 years ago
- ☆22Updated 2 years ago
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.☆66Updated 2 years ago
- Vector search with bounded performance.☆35Updated last year
- A Framework for Graph Sampling and Random Walk on GPUs.☆39Updated 3 months ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆52Updated 9 months ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆38Updated 3 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- ☆43Updated last year
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆73Updated 2 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆121Updated 2 years ago
- FGNN's artifact evaluation (EuroSys 2022)☆17Updated 3 years ago
- Dorylus: Affordable, Scalable, and Accurate GNN Training☆76Updated 4 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆40Updated last year
- ☆79Updated 2 years ago
- Out-of-GPU-Memory Graph Processing with Minimal Data Transfer☆53Updated 2 years ago
- ☆30Updated 2 years ago
- Seminar on selected tools in Computer Science☆25Updated 4 years ago
- ngAP's artifact for ASPLOS'24☆23Updated 4 months ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆31Updated last year