A curated list of awesome papers about information retrieval(IR) in the age of large language model(LLM). These include retrieval augmented large language model, large language model for information retrieval, and so on.
☆79Aug 19, 2024Updated last year
Alternatives and similar repositories for Awesome-Information-Retrieval-in-the-Age-of-Large-Language-Model
Users that are interested in Awesome-Information-Retrieval-in-the-Age-of-Large-Language-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated last year
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆25Jul 19, 2024Updated last year
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆339Jun 17, 2023Updated 2 years ago
- ☆31Sep 25, 2024Updated last year
- This is the repo for the survey of LLM4IR.☆535Nov 13, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for "Neural Retrievers are Biased Towards LLM-Generated Content"☆14Oct 18, 2024Updated last year
- A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…☆220Jul 11, 2024Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆34Dec 9, 2025Updated 5 months ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆738May 18, 2026Updated last week
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- ☆721Oct 7, 2025Updated 7 months ago
- This project collects awesome resources (e.g., papers, open-source models) for large language model (LLM)☆254Mar 28, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- Submission archive for the MS MARCO document ranking leaderboard☆31Oct 9, 2023Updated 2 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- A toolkit to automatically crawl the paper list and download paper pdfs of ACL Ahthology.☆11Nov 12, 2025Updated 6 months ago
- The awesome agents in the era of large language models☆72Nov 18, 2023Updated 2 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Aug 10, 2023Updated 2 years ago
- This is a repo consisting of papers about LLMs' perception of their knowledge boundaries; Uncertainty Quantification; Honesty Alignment; …☆25Nov 25, 2025Updated 6 months ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Apr 21, 2023Updated 3 years ago
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆51Dec 14, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆26Apr 11, 2024Updated 2 years ago
- A curated list of resources dedicated to retrieval-augmented generation (RAG).☆134Oct 31, 2025Updated 6 months ago
- Document Ranking with Large Language Models.☆210Feb 14, 2026Updated 3 months ago
- Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)☆22Aug 28, 2023Updated 2 years ago
- ☆21Apr 17, 2023Updated 3 years ago
- ☆39Nov 21, 2022Updated 3 years ago
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- Codebase for RetroMAE and beyond.☆273Jun 7, 2024Updated last year
- ☆19May 16, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 4 months ago
- A collection of papers tackling automatic fact-checking (particularly of AI-generated content)☆13Nov 3, 2023Updated 2 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆65Feb 5, 2026Updated 3 months ago
- 为了类脑计算☆18Jun 12, 2019Updated 6 years ago
- MultiSpanQA: A Dataset for Multi-Span Question Answering☆28Jan 24, 2026Updated 4 months ago
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated 2 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 4 years ago