A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search
☆21Jul 22, 2025Updated 11 months ago
Alternatives and similar repositories for PathWeaver
Users that are interested in PathWeaver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PPoPP24 AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping☆22May 8, 2024Updated 2 years ago
- ☆21Jun 6, 2024Updated 2 years ago
- ☆28Nov 29, 2024Updated last year
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- ☆30Apr 22, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Segmented Code Adjustment Quantization (SAQ)☆25Sep 22, 2025Updated 9 months ago
- ☆23Jun 1, 2025Updated last year
- ☆14Jan 12, 2022Updated 4 years ago
- C++ 实现 BP 神经网络识别手写数字数据集 MNIST☆14Jan 7, 2024Updated 2 years ago
- A new DRAM substrate that mitigates the excessive energy consumption from both (i) transmitting unused data on the memory channel and (i…☆14Aug 23, 2024Updated last year
- Source code for the paper: Accelerating Dynamic Graph Analytics on GPUs☆30Jun 19, 2023Updated 3 years ago
- ☆14Jan 20, 2025Updated last year
- Official implementation for paper "Navigating Labels and Vectors: A Unified Approach to Filtered Approximate Nearest Neighbor Search"☆37Dec 21, 2024Updated last year
- ☆33Sep 9, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of yo…☆22May 6, 2026Updated last month
- ☆13Oct 6, 2024Updated last year
- A low-latency, billion-scale, and updatable graph-based vector store on SSD.☆140Jun 25, 2026Updated last week
- ☆24Apr 4, 2024Updated 2 years ago
- ☆33Jan 22, 2025Updated last year
- Block-based Approximate Nearest Neighbor☆35Nov 1, 2021Updated 4 years ago
- Python CFFI Binding around SuiteSparse:GraphBLAS☆24Apr 27, 2026Updated 2 months ago
- ucas course information for students majoring in computer architecture.☆12Jul 7, 2024Updated last year
- ☆13Jun 14, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deft: A Scalable Tree Index for Disaggregated Memory☆22Apr 23, 2025Updated last year
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆52Jul 21, 2025Updated 11 months ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆141May 3, 2025Updated last year
- ☆26Jan 21, 2026Updated 5 months ago
- Codes of the paper "Time Constrained Continuous Subgraph Search Over Streaming Graphs. ICDE 2019: 1082-1093". Authors: Youhuan Li, Lei Zo…☆12Jun 18, 2021Updated 5 years ago
- Artifact evaluation of PLDI'24 paper "Allo: A Programming Model for Composable Accelerator Design"☆35Apr 11, 2024Updated 2 years ago
- Stock Market predictions with Prophet and FastAPI☆17Dec 22, 2021Updated 4 years ago
- High performance implementation of the WARP (SIGIR'25) retrieval engine.☆34May 21, 2026Updated last month
- MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…☆114Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆15Updated this week
- The approach involves the usage of Multi-Criteria Decision Analyses, including Weighted Sum Model (WSM), Weighted Product Model (WPM) and…☆11Oct 22, 2021Updated 4 years ago
- Prompt format and padding guide for Llama 2☆12Sep 18, 2023Updated 2 years ago
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆12Oct 16, 2023Updated 2 years ago
- ☆14Apr 24, 2024Updated 2 years ago
- A graph linear algebra overlay☆52Apr 26, 2023Updated 3 years ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆12Jun 28, 2025Updated last year