A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search
☆21Jul 22, 2025Updated 8 months ago
Alternatives and similar repositories for PathWeaver
Users that are interested in PathWeaver are comparing it to the libraries listed below
Sorting:
- PPoPP24 AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping☆22May 8, 2024Updated last year
- ☆21Jun 6, 2024Updated last year
- ☆28Nov 29, 2024Updated last year
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- ☆21Mar 15, 2026Updated last week
- Segmented Code Adjustment Quantization (SAQ)☆18Sep 22, 2025Updated 6 months ago
- ☆20Jun 1, 2025Updated 9 months ago
- ☆14Jan 12, 2022Updated 4 years ago
- C++ 实现 BP 神经网络识别手写数字数据集 MNIST☆14Jan 7, 2024Updated 2 years ago
- A new DRAM substrate that mitigates the excessive energy consumption from both (i) transmitting unused data on the memory channel and (i…☆14Aug 23, 2024Updated last year
- Source code for the paper: Accelerating Dynamic Graph Analytics on GPUs☆30Jun 19, 2023Updated 2 years ago
- ☆14Jan 20, 2025Updated last year
- Official implementation for paper "Navigating Labels and Vectors: A Unified Approach to Filtered Approximate Nearest Neighbor Search"☆35Dec 21, 2024Updated last year
- ☆33Sep 9, 2020Updated 5 years ago
- A low-latency, billion-scale, and updatable graph-based vector store on SSD.☆102Feb 4, 2026Updated last month
- A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of yo…☆22Feb 7, 2025Updated last year
- ☆13Oct 6, 2024Updated last year
- ☆24Apr 4, 2024Updated last year
- ☆32Jan 22, 2025Updated last year
- Block-based Approximate Nearest Neighbor☆35Nov 1, 2021Updated 4 years ago
- Python CFFI Binding around SuiteSparse:GraphBLAS☆24Sep 14, 2025Updated 6 months ago
- ucas course information for students majoring in computer architecture.☆12Jul 7, 2024Updated last year
- ☆13Jan 28, 2026Updated last month
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆53Jul 21, 2025Updated 8 months ago
- Deft: A Scalable Tree Index for Disaggregated Memory☆23Apr 23, 2025Updated 10 months ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆129May 3, 2025Updated 10 months ago
- Codes of the paper "Time Constrained Continuous Subgraph Search Over Streaming Graphs. ICDE 2019: 1082-1093". Authors: Youhuan Li, Lei Zo…☆12Jun 18, 2021Updated 4 years ago
- ☆26Jan 21, 2026Updated 2 months ago
- Stock Market predictions with Prophet and FastAPI☆17Dec 22, 2021Updated 4 years ago
- Artifact evaluation of PLDI'24 paper "Allo: A Programming Model for Composable Accelerator Design"☆33Apr 11, 2024Updated last year
- MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…☆87Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Feb 18, 2026Updated last month
- The approach involves the usage of Multi-Criteria Decision Analyses, including Weighted Sum Model (WSM), Weighted Product Model (WPM) and…☆11Oct 22, 2021Updated 4 years ago
- Prompt format and padding guide for Llama 2☆12Sep 18, 2023Updated 2 years ago
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago
- A graph linear algebra overlay☆52Apr 26, 2023Updated 2 years ago
- ☆14Apr 24, 2024Updated last year
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆13Jun 28, 2025Updated 8 months ago
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆14Dec 9, 2024Updated last year