TobiasNorlund/retro

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TobiasNorlund/retro)

TobiasNorlund / retro

Official repo to On the Generalization Ability of Retrieval-Enhanced Transformers

☆47

Alternatives and similar repositories for retro

Users that are interested in retro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

0x7o / RETRO-transformer
View on GitHub
Easy-to-use Retrieval-Enhanced Transformer implementation
☆10Sep 30, 2022Updated 3 years ago
amazon-science / piperag
View on GitHub
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)
☆32Jun 14, 2024Updated 2 years ago
WenqiJiang / SC-ANN-FPGA
View on GitHub
☆26May 30, 2025Updated last year
yale-sys / prompt-cache
View on GitHub
Modular and structured prompt caching for low-latency LLM inference
☆115Nov 9, 2024Updated last year
MooreThreads / TurboRAG
View on GitHub
☆102Nov 25, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lucidrains / RETRO-pytorch
View on GitHub
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
☆879Oct 30, 2023Updated 2 years ago
microsoft / RetrievalAttention
View on GitHub
[VLDB 26, NeurIPS 25] Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.
☆149Feb 22, 2026Updated 5 months ago
YZ-Cai / Unified-Navigating-Graph
View on GitHub
Official implementation for paper "Navigating Labels and Vectors: A Unified Approach to Filtered Approximate Nearest Neighbor Search"
☆38Dec 21, 2024Updated last year
Leo9660 / HedraRAG_AE
View on GitHub
Artifact Evaluation for SOSP 2025
☆22Aug 16, 2025Updated 11 months ago
JigaoLuo / modern_dbs
View on GitHub
IN2118 Databases Implementation on Modern CPU Architectures, SS 2020, TUM
☆20Oct 10, 2020Updated 5 years ago
Langboat / mengzi-retrieval-lm
View on GitHub
An experimental implementation of the retrieval-enhanced language model
☆74Dec 29, 2022Updated 3 years ago
uclnlp / EMAT
View on GitHub
Efficient Memory-Augmented Transformers
☆35Dec 5, 2022Updated 3 years ago
GraphBLAS / python-suitesparse-graphblas
View on GitHub
Python CFFI Binding around SuiteSparse:GraphBLAS
☆24Apr 27, 2026Updated 3 months ago
antgroup / cakekv
View on GitHub
☆39Mar 17, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TalnUPF / ConceptExtraction
View on GitHub
☆11Aug 15, 2023Updated 2 years ago
Liu-Cheng / graph_accelerator
View on GitHub
Graph accelerator on FPGAs and ASICs
☆11Aug 16, 2018Updated 7 years ago
CaucherWang / Steiner-hardness
View on GitHub
A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of yo…
☆22May 6, 2026Updated 2 months ago
MiuLab / Time-SLU
View on GitHub
Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding
☆14Sep 28, 2017Updated 8 years ago
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
zanmato1984 / cura
View on GitHub
CURA - CUDA Relational Algebra
☆31Jan 30, 2023Updated 3 years ago
tatHi / optok
View on GitHub
☆10Aug 26, 2021Updated 4 years ago
mila-iqia / Casande-RL
View on GitHub
Casande-RL
☆11May 9, 2023Updated 3 years ago
sjsafranek / facebook-automation
View on GitHub
Python3 / Selenium solution for automating your Facebook usage
☆12Jan 11, 2016Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Scientific-Computing-Lab / STREAMer
View on GitHub
STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth
☆18Aug 21, 2023Updated 2 years ago
Victorwz / tod_as_nlg
View on GitHub
Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".
☆14Apr 6, 2022Updated 4 years ago
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆14Mar 27, 2025Updated last year
AIS-SNU / PathWeaver
View on GitHub
A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search
☆21Jul 22, 2025Updated last year
tiannuo-yang / SearchAgent-X
View on GitHub
A High-Efficiency System of Large Language Model Based Search Agents
☆79Jul 2, 2025Updated last year
AI21Labs / in-context-ralm
View on GitHub
☆295Dec 20, 2023Updated 2 years ago
Heisenberg-Yin / DEG
View on GitHub
☆20May 30, 2025Updated last year
uw-mad-dash / decoding-speculative-decoding
View on GitHub
☆16Aug 19, 2024Updated last year
jackgo73 / oldb
View on GitHub
database
☆11Aug 31, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jychen21 / Habana-LLM-Viewer
View on GitHub
☆13Jul 24, 2024Updated 2 years ago
AISys-01 / vllm-CachedAttention
View on GitHub
The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.
☆11Sep 19, 2024Updated last year
lizardll / ScalaBFS
View on GitHub
A Scalable BFS Accelerator on FPGA-HBM Platform
☆13Jul 30, 2021Updated 4 years ago
tsinghua-fib-lab / RoboScape
View on GitHub
☆26Jun 29, 2025Updated last year
maltanar / spmvaccsim
View on GitHub
A SystemC + DRAMSim2 simulator for exploring the SpMV hardware accelerator design space.
☆15Nov 9, 2014Updated 11 years ago
kaishxu / DFMed
View on GitHub
Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)
☆14Nov 22, 2023Updated 2 years ago
jxiw / UDO
View on GitHub
Universal Database Optimization using Reinforcement Learning
☆27Mar 23, 2023Updated 3 years ago