AIS-SNU / PathWeaverLinks
A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search
☆20Updated 4 months ago
Alternatives and similar repositories for PathWeaver
Users that are interested in PathWeaver are comparing it to the libraries listed below
Sorting:
- ☆19Updated 6 months ago
- ☆41Updated 6 months ago
- ☆26Updated 3 months ago
- [ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Y…☆33Updated 2 years ago
- ☆28Updated last year
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆41Updated last year
- Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching☆41Updated last year
- SoCC'20 and TPDS'21: Scaling GNN Training on Large Graphs via Computation-aware Caching and Partitioning.☆51Updated 2 years ago
- Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.☆52Updated 2 years ago
- Graph Sampling using GPU☆52Updated 3 years ago
- [SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference☆80Updated last week
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆31Updated last year
- Distributed Multi-GPU GNN Framework☆36Updated 5 years ago
- A Framework for Graph Sampling and Random Walk on GPUs.☆38Updated 10 months ago
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.☆69Updated 2 years ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆40Updated 4 years ago
- Set of datasets for the deep learning recommendation model (DLRM).☆48Updated 2 years ago
- ☆47Updated 3 years ago
- Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB)☆44Updated 2 years ago
- A Factored System for Sample-based GNN Training over GPUs☆44Updated 2 years ago
- FlashMob is a shared-memory random walk system.☆32Updated 2 years ago
- Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.☆106Updated 3 months ago
- Query-Adaptive Vector Search☆65Updated 3 weeks ago
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆50Updated 4 months ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆59Updated 3 years ago
- ☆31Updated last year
- FGNN's artifact evaluation (EuroSys 2022)☆17Updated 3 years ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆30Updated 3 years ago
- ☆112Updated 4 years ago
- GPU TopK Benchmark☆17Updated last year