Dense X Retrieval: What Retrieval Granularity Should We Use?
☆168Jan 8, 2024Updated 2 years ago
Alternatives and similar repositories for factoid-wiki
Users that are interested in factoid-wiki are comparing it to the libraries listed below
Sorting:
- Enhancing Retrieval and Managing Retrieval: 4-Module Synergy☆23Dec 7, 2024Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated 2 months ago
- This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João…☆92Feb 9, 2026Updated last month
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 3 years ago
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆40Jun 9, 2023Updated 2 years ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆667Nov 20, 2023Updated 2 years ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆52Jul 3, 2024Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 9 months ago
- HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels☆571Dec 6, 2024Updated last year
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆62May 24, 2023Updated 2 years ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆582Updated this week
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Dec 27, 2022Updated 3 years ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Dec 2, 2024Updated last year
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆1,602Sep 3, 2024Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆87Jul 31, 2024Updated last year
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,327May 25, 2024Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆240Feb 26, 2026Updated last week
- Scalable training for dense retrieval models.☆298Jun 10, 2025Updated 8 months ago
- ☆140Aug 21, 2023Updated 2 years ago
- ☆187Jul 2, 2025Updated 8 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆61Jul 16, 2025Updated 7 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,026Updated this week
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆253Jun 12, 2024Updated last year
- ☆29Apr 8, 2025Updated 11 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Aug 18, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,868May 17, 2025Updated 9 months ago
- Code and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.☆292Jan 29, 2023Updated 3 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆273Sep 25, 2025Updated 5 months ago
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆196Apr 6, 2024Updated last year
- Code for KaLM-Embedding models☆114Jun 30, 2025Updated 8 months ago
- ☆34Dec 18, 2025Updated 2 months ago
- ☆26Nov 21, 2022Updated 3 years ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆89Nov 4, 2025Updated 4 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Oct 28, 2025Updated 4 months ago
- An Open-Source Package for Information Retrieval☆168Mar 2, 2026Updated last week