scripts to download and standardize trec query and document sets
☆48Aug 7, 2019Updated 6 years ago
Alternatives and similar repositories for trec-data
Users that are interested in trec-data are comparing it to the libraries listed below
Sorting:
- Standalone Neural Ranking Model (SNRM)☆76Dec 26, 2018Updated 7 years ago
- Knowledge graph Entity and Word Embeddings for Retrieval☆11Nov 19, 2021Updated 4 years ago
- Source code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19☆48Apr 30, 2019Updated 6 years ago
- DBpedia-Entity v2: A Test Collection for Entity Search☆62Oct 16, 2020Updated 5 years ago
- ☆160Jun 9, 2021Updated 4 years ago
- The Tweets2013 Internet Archive collection☆10Aug 7, 2020Updated 5 years ago
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆52Mar 3, 2025Updated last year
- A large scale feature extraction tool for text-based machine learning☆32Sep 6, 2022Updated 3 years ago
- NPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval☆31Nov 26, 2018Updated 7 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆51Jan 11, 2016Updated 10 years ago
- R library for common information retrieval metrics☆14Jun 5, 2023Updated 2 years ago
- The code for COPACRR Neural IR model.☆37Feb 6, 2018Updated 8 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Nov 17, 2019Updated 6 years ago
- A repository for Neural Document Ranking Models.☆83Sep 15, 2018Updated 7 years ago
- Resources for Tutorial on "Utilizing Knowledge Graphs in Text-centric Information Retrieval"☆158Jul 8, 2018Updated 7 years ago
- Minimalistic BM25 search engine in C/C++, Java, and nearly 20 other languages☆22Jun 19, 2024Updated last year
- Tool for comparing two ranked lists (TREC run files)☆20Nov 9, 2022Updated 3 years ago
- Tools for working with the TREC CAR dataset.☆36Jul 12, 2025Updated 8 months ago
- Experimental Git Mirror of "https://sourceforge.net/p/lemur/galago" using "https://github.com/felipec/git-remote-hg"☆13Dec 17, 2020Updated 5 years ago
- A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.☆23Nov 9, 2025Updated 4 months ago
- Fielded Sequential Dependence Model (code and runs)☆32Dec 23, 2015Updated 10 years ago
- IAI Style Guide☆11Jun 27, 2025Updated 8 months ago
- Improving the effectiveness Lucene's BM25 (and testing it using Yahoo! Answers and Stack Overflow collections)☆16Feb 26, 2022Updated 4 years ago
- ☆34Feb 17, 2021Updated 5 years ago
- ☆13Nov 15, 2017Updated 8 years ago
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆345Oct 10, 2023Updated 2 years ago
- Resources for the Tutorial on "Utilizing Knowledge Bases in Text-centric Information Retrieval"☆25Sep 18, 2016Updated 9 years ago
- An end-to-end neural ad-hoc ranking pipeline.☆153Jul 13, 2025Updated 8 months ago
- Official repository of the ACM SIGIR 2019 paper: "Fast Approximate Filtering of Search Results Sorted by Attribute" by Franco Maria Nardi…☆14Nov 7, 2019Updated 6 years ago
- Tools relating to the CC-News-En Collection☆20Dec 8, 2023Updated 2 years ago
- Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET☆190Oct 24, 2019Updated 6 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆22Nov 28, 2021Updated 4 years ago
- ☆20Jan 16, 2020Updated 6 years ago
- Jig for the Open-Source IR Replicability Challenge (OSIRRC)☆13Dec 8, 2022Updated 3 years ago
- Dynamic Entity Summarization (DynES)☆20May 10, 2019Updated 6 years ago
- Neural-IR-Explorer: A Content-Focused Tool to Explore Neural Re-Ranking Results☆32Dec 13, 2019Updated 6 years ago
- ☆50Sep 3, 2019Updated 6 years ago
- Anserini is a Lucene toolkit for reproducible information retrieval research☆1,106Mar 15, 2026Updated last week
- Evaluation software used in the Text Retrieval Conference☆276Mar 9, 2026Updated last week