google/rago

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google/rago)

google / rago

☆31

Alternatives and similar repositories for rago

Users that are interested in rago are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Leo9660 / HedraRAG_AE
View on GitHub
Artifact Evaluation for SOSP 2025
☆21Aug 16, 2025Updated 11 months ago
abhibambhaniya / GenZ-LLM-Analyzer
View on GitHub
LLM Inference analyzer for different hardware platforms
☆119Jun 23, 2026Updated 3 weeks ago
junyimei / flowwalker-artifact
View on GitHub
Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"
☆11Oct 23, 2024Updated last year
naimengye / speculative-action
View on GitHub
☆30Mar 9, 2026Updated 4 months ago
fpgasystems / Chameleon-RAG-Acceleration
View on GitHub
☆23Jun 1, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AutonomicPerfectionist / PipeInfer
View on GitHub
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation
☆32Nov 16, 2024Updated last year
hku-systems / naspipe
View on GitHub
☆14Jan 12, 2022Updated 4 years ago
caoshiyi / artifacts
View on GitHub
☆40Nov 28, 2024Updated last year
ParCIS / FlashSparse
View on GitHub
FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…
☆39Oct 5, 2025Updated 9 months ago
stanford-mast / cedar
View on GitHub
☆19Oct 22, 2024Updated last year
TNAS-DCS / TNAS-DCS
View on GitHub
☆13Aug 9, 2022Updated 3 years ago
shin-yamashita / 6th-AI-Edge-Contest
View on GitHub
RTL implementation of TFlite FPGA accelerator and RISC-V controller. 3D Object Detection based on LiDAR Point Clouds.
☆17Jun 24, 2026Updated 3 weeks ago
MachineLearningSystem / 25ASPLOS-Medusa
View on GitHub
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
☆12Nov 8, 2024Updated last year
HPMLL / DTC-SpMM_ASPLOS24
View on GitHub
☆47Jun 19, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
dywsjtu / apparate
View on GitHub
Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
☆24Nov 21, 2024Updated last year
eddiegaoo / Apt-Serve
View on GitHub
☆21Jun 9, 2025Updated last year
LLMServe / hydraserve
View on GitHub
☆20May 11, 2026Updated 2 months ago
mediroozmeh / FPGA_BitonicSorting
View on GitHub
Implementation of BitonicSorting algorithm on FPGA through SDAccel using Opencl as source code
☆17Nov 21, 2016Updated 9 years ago
AutomataLab / JSONSki
View on GitHub
JSONPath Streaming with Bit-Parallel Fast-Forwarding
☆33Oct 10, 2024Updated last year
Xilinx / xup_aie_training
View on GitHub
Hands-on experience programming AI Engines using Vitis Unified Software Platform
☆42Jul 24, 2024Updated last year
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
AlbertoParravicini / approximate-spmv-topk
View on GitHub
Public repostory for the DAC 2021 paper "Scaling up HBM Efficiency of Top-K SpMV forApproximate Embedding Similarity on FPGAs"
☆16Aug 29, 2021Updated 4 years ago
ziliuziliu / FaaSGraph
View on GitHub
☆22Mar 2, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
redbird-arch / isca2025-chimera-artifact
View on GitHub
Artifact of Chimera
☆18May 6, 2025Updated last year
ParCoreLab / aCG
View on GitHub
GPU-accelerated linear solvers based on the conjugate gradient (CG) method, supporting NVIDIA and AMD GPUs with GPU-aware MPI, NCCL, RCCL…
☆16Mar 14, 2026Updated 4 months ago
araij / rabbit_order
View on GitHub
☆49Jan 30, 2026Updated 5 months ago
All-less / faas-scheduling-benchmark
View on GitHub
A benchmark suite for evaluating FaaS scheduler.
☆23Nov 5, 2022Updated 3 years ago
microsoft / RetrievalAttention
View on GitHub
[VLDB 26, NeurIPS 25] Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.
☆147Feb 22, 2026Updated 4 months ago
sakura-ysy / rdma-rpc
View on GitHub
C++ RPC based on RDMA
☆13Sep 12, 2023Updated 2 years ago
salehjg / DeepPoint-V2-FPGA
View on GitHub
The code repository of DGCNN on FPGA: Acceleration of The Point Cloud Classifier Using FPGAs
☆17Mar 6, 2023Updated 3 years ago
Scientific-Computing-Lab / STREAMer
View on GitHub
STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth
☆18Aug 21, 2023Updated 2 years ago
uwdb / vss
View on GitHub
VSS: A Storage System for Video Analytics
☆13Jul 9, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SFU-HiAccel / HiSpMV
View on GitHub
[TRETS 2025][FPGA 2024] FPGA Accelerator for Imbalanced SpMV using HLS
☆23Aug 24, 2025Updated 10 months ago
luzhixing12345 / klinux
View on GitHub
linux 内核技术文档
☆16Apr 27, 2026Updated 2 months ago
pmem / pmem.github.io
View on GitHub
The pmem.io Website
☆17Jan 20, 2026Updated 6 months ago
YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago
readwrite112 / AGAThA
View on GitHub
PPoPP24 AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping
☆22May 8, 2024Updated 2 years ago
tensorcast-ai / tensorcast
View on GitHub
The high-performance distributed tensor layer — load once, share everywhere.
☆30Jun 23, 2026Updated 3 weeks ago
SusCom-Lab / ZSMerge
View on GitHub
☆23Sep 24, 2025Updated 9 months ago