Official repo to On the Generalization Ability of Retrieval-Enhanced Transformers
☆47Jun 4, 2024Updated 2 years ago
Alternatives and similar repositories for retro
Users that are interested in retro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Aug 23, 2024Updated last year
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆32Jun 14, 2024Updated 2 years ago
- How to plot for papers, slides, demos, etc.☆10Apr 7, 2022Updated 4 years ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- ☆103Nov 25, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A code base for Vexless☆17Mar 7, 2024Updated 2 years ago
- GHive: Accelerating Analytical Query Processing in Apache Hive via CPU-GPU Heterogeneous Computing.☆14Nov 8, 2023Updated 2 years ago
- ☆25Dec 1, 2020Updated 5 years ago
- IN2118 Databases Implementation on Modern CPU Architectures, SS 2020, TUM☆20Oct 10, 2020Updated 5 years ago
- ☆18May 30, 2025Updated last year
- An experimental implementation of the retrieval-enhanced language model☆74Dec 29, 2022Updated 3 years ago
- Efficient Memory-Augmented Transformers☆35Dec 5, 2022Updated 3 years ago
- Natural language understanding benchmarks for Norwegian☆14Aug 29, 2025Updated 9 months ago
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆877Oct 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression (DAC'25)☆30Feb 26, 2026Updated 3 months ago
- ☆11Aug 15, 2023Updated 2 years ago
- Modular and structured prompt caching for low-latency LLM inference☆114Nov 9, 2024Updated last year
- Graph accelerator on FPGAs and ASICs☆11Aug 16, 2018Updated 7 years ago
- A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of yo…☆22May 6, 2026Updated last month
- CoCo-Ex extracts meaningful concepts from natural language texts and maps them to conjunct concept nodes in ConceptNet, utilizing the max…☆59Dec 2, 2022Updated 3 years ago
- ☆13Jan 8, 2020Updated 6 years ago
- ☆19Mar 13, 2016Updated 10 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆37Updated this week
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆77Apr 27, 2026Updated last month
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆12Mar 27, 2025Updated last year
- GPU-Based Approximate Nearest Neighbor Search☆32May 15, 2026Updated last month
- Article: GPU-accelerated Proximity Graph Approximate Nearest Neighbor Search and Construction by Authors Yuanhang Yu, Dong Wen, Ying Zhan…☆24Jun 20, 2025Updated 11 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- ☆11Dec 8, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- FleetRec: Large-Scale Recommendation Inference on Hybrid GPU-FPGA Clusters☆17May 26, 2021Updated 5 years ago
- ☆11Dec 29, 2023Updated 2 years ago
- Evaluate state-of-the-art GPU joins☆14Nov 29, 2023Updated 2 years ago
- Python binding for the G'MIC Image Processing Framework☆11Nov 14, 2025Updated 7 months ago
- ☆19May 16, 2024Updated 2 years ago
- [SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference☆91Dec 7, 2025Updated 6 months ago
- A library of speech gadgets.☆15Oct 15, 2022Updated 3 years ago