chhzh123 / KrillLinks
An efficient concurrent graph processing system
☆46Updated 3 years ago
Alternatives and similar repositories for Krill
Users that are interested in Krill are comparing it to the libraries listed below
Sorting:
- My paper/code reading notes in Chinese☆46Updated 2 months ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Updated 3 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆42Updated 3 years ago
- ☆36Updated last year
- ngAP's artifact for ASPLOS'24☆24Updated 3 weeks ago
- GVProf: A Value Profiler for GPU-based Clusters☆51Updated last year
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆19Updated 2 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆31Updated 6 months ago
- Vector search with bounded performance.☆36Updated last year
- Rebuild YatSenOS On RISC-V 64.☆20Updated 3 years ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆74Updated 2 years ago
- Dorylus: Affordable, Scalable, and Accurate GNN Training☆76Updated 4 years ago
- Graph Pattern Mining☆90Updated 11 months ago
- A Factored System for Sample-based GNN Training over GPUs☆42Updated 2 years ago
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.☆66Updated 2 years ago
- Papers on Graph Analytics, Mining, and Learning☆127Updated 3 years ago
- MemLiner is a remote-memory-friendly runtime system.☆31Updated 2 years ago
- Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling☆13Updated last year
- Tigon: A Distributed Database for a CXL Pod [OSDI '25]☆28Updated 2 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- A Framework for Graph Sampling and Random Walk on GPUs.☆38Updated 6 months ago
- ☆70Updated 4 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆53Updated last year
- FlashMob is a shared-memory random walk system.☆32Updated 2 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆35Updated 2 years ago
- ☆30Updated last year
- Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.☆49Updated last year
- Out-of-GPU-Memory Graph Processing with Minimal Data Transfer☆55Updated 2 years ago
- ☆24Updated 3 years ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆36Updated 4 months ago