MooreThreads / TurboRAG
☆79Updated 5 months ago
Alternatives and similar repositories for TurboRAG:
Users that are interested in TurboRAG are comparing it to the libraries listed below
- Modular and structured prompt caching for low-latency LLM inference☆92Updated 6 months ago
- PGRAG☆48Updated 9 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆82Updated 3 months ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆153Updated 5 months ago
- ☆162Updated last month
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆136Updated 9 months ago
- ☆30Updated 9 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆102Updated 3 months ago
- A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration☆31Updated last week
- ☆94Updated 5 months ago
- Code for Parametric RAG, SIGIR 2025 Full Paper☆161Updated last week
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆131Updated 4 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆137Updated 10 months ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆57Updated 4 months ago
- ☆47Updated 4 months ago
- ☆40Updated 2 months ago
- Simple extension on vLLM to help you speed up reasoning model without training.☆149Updated last week
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆296Updated 6 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆140Updated 4 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆244Updated 3 weeks ago
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers☆65Updated 11 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆156Updated last month
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆103Updated last month
- FuseAI Project☆86Updated 3 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 11 months ago
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…☆229Updated last month
- [NIPS'24] UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis☆37Updated 2 months ago
- ☆56Updated 6 months ago
- A pipeline for LLM knowledge distillation☆102Updated last month
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆243Updated last year