☆94Nov 25, 2024Updated last year
Alternatives and similar repositories for TurboRAG
Users that are interested in TurboRAG are comparing it to the libraries listed below
Sorting:
- ☆27Apr 17, 2025Updated 10 months ago
- ☆21Apr 17, 2025Updated 10 months ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆30Jun 14, 2024Updated last year
- ☆165Jul 15, 2025Updated 7 months ago
- ☆19Mar 13, 2016Updated 9 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆13Mar 30, 2024Updated last year
- Vectorized intersections (research code)☆16Jan 13, 2017Updated 9 years ago
- ☆20Apr 3, 2025Updated 10 months ago
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆144Dec 4, 2024Updated last year
- A dataset of news headlines for detecting causalities☆14May 9, 2022Updated 3 years ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆33Feb 1, 2026Updated 3 weeks ago
- An experimentation platform for LLM inference optimisation☆36Sep 19, 2024Updated last year
- Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.☆19Dec 6, 2024Updated last year
- ☆16Jan 24, 2025Updated last year
- ☆16Jul 23, 2024Updated last year
- [SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference☆82Dec 7, 2025Updated 2 months ago
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆52Aug 6, 2025Updated 6 months ago
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- ☆19Mar 10, 2025Updated 11 months ago
- RuleRAG: Rule Meets Retrieval-Augmented Generation for Question Answering☆32Oct 8, 2025Updated 4 months ago
- ☆32Oct 13, 2025Updated 4 months ago
- ☆20Jul 11, 2023Updated 2 years ago
- a survey on deep research☆47Sep 9, 2025Updated 5 months ago
- ☆50May 22, 2024Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆42Aug 25, 2025Updated 6 months ago
- ☆150Oct 9, 2024Updated last year
- ☆34Oct 9, 2025Updated 4 months ago
- ☆46Jun 24, 2025Updated 8 months ago
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆22Oct 25, 2023Updated 2 years ago
- Simulation code for the LHD cache replacement policy as published in NSDI 2018.☆25Jul 23, 2018Updated 7 years ago
- ☆31Jun 12, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Revision of official yolov7-pose to support custom dataset for keypoint detection☆11Nov 12, 2023Updated 2 years ago
- ☆28May 24, 2025Updated 9 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…☆1,188Sep 30, 2025Updated 5 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Apr 9, 2025Updated 10 months ago
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year