A curated list of awesome papers about information retrieval(IR) in the age of large language model(LLM). These include retrieval augmented large language model, large language model for information retrieval, and so on.
☆78Aug 19, 2024Updated last year
Alternatives and similar repositories for Awesome-Information-Retrieval-in-the-Age-of-Large-Language-Model
Users that are interested in Awesome-Information-Retrieval-in-the-Age-of-Large-Language-Model are comparing it to the libraries listed below
Sorting:
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated last year
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆337Jun 17, 2023Updated 2 years ago
- This is the repo for the survey of LLM4IR.☆532Nov 13, 2025Updated 4 months ago
- ☆30Sep 25, 2024Updated last year
- A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).☆677Jan 7, 2024Updated 2 years ago
- WSDM'2021, PROP and SIGIR'2021,B-PROP☆110May 18, 2023Updated 2 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆33Dec 9, 2025Updated 3 months ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆734Jan 26, 2026Updated last month
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- ☆27Oct 7, 2025Updated 5 months ago
- ☆720Oct 7, 2025Updated 5 months ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- Submission archive for the MS MARCO document ranking leaderboard☆31Oct 9, 2023Updated 2 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- The awesome agents in the era of large language models☆71Nov 18, 2023Updated 2 years ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Apr 21, 2023Updated 2 years ago
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆49Dec 14, 2023Updated 2 years ago
- NAACL2021 - COIL Contextualized Lexical Retriever☆157Jul 27, 2021Updated 4 years ago
- A curated list of resources dedicated to retrieval-augmented generation (RAG).☆130Oct 31, 2025Updated 4 months ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- ☆26Apr 11, 2024Updated last year
- Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)☆22Aug 28, 2023Updated 2 years ago
- ☆21Apr 17, 2023Updated 2 years ago
- Code for Paper "PMAES: Prompt-mapping Contrastive Learning for Cross-prompt Automated Essay Scoring" ACL2023☆11Oct 6, 2023Updated 2 years ago
- ☆39Nov 21, 2022Updated 3 years ago
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- Official implementation of "Graph Signal Diffusion Model for Collaborative Filtering" (SIGIR 2024)☆17May 31, 2024Updated last year
- Codebase for RetroMAE and beyond.☆272Jun 7, 2024Updated last year
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 2 months ago
- A collection of papers tackling automatic fact-checking (particularly of AI-generated content)☆14Nov 3, 2023Updated 2 years ago
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆16Apr 24, 2025Updated 10 months ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Oct 24, 2023Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 5 years ago
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated last year
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 4 years ago
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025☆15Nov 25, 2025Updated 3 months ago
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14May 6, 2023Updated 2 years ago