scripts to download and standardize trec query and document sets
☆48Aug 7, 2019Updated 6 years ago
Alternatives and similar repositories for trec-data
Users that are interested in trec-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A clone of indri-5.12 with minor customizations.☆25Sep 23, 2024Updated last year
- Knowledge graph Entity and Word Embeddings for Retrieval☆11Nov 19, 2021Updated 4 years ago
- Source code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19☆48Apr 30, 2019Updated 6 years ago
- DBpedia-Entity v2: A Test Collection for Entity Search☆62Oct 16, 2020Updated 5 years ago
- ☆160Jun 9, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Joint Optimization of Cascade Ranking Models (WSDM 19)☆13Jun 21, 2022Updated 3 years ago
- The Tweets2013 Internet Archive collection☆10Aug 7, 2020Updated 5 years ago
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆52Mar 3, 2025Updated last year
- A large scale feature extraction tool for text-based machine learning☆32Sep 6, 2022Updated 3 years ago
- NPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval☆31Nov 26, 2018Updated 7 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆51Jan 11, 2016Updated 10 years ago
- R library for common information retrieval metrics☆14Jun 5, 2023Updated 2 years ago
- The code for COPACRR Neural IR model.☆37Feb 6, 2018Updated 8 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Nov 17, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A toolkit for simulating interactive information retrieval☆21Sep 7, 2018Updated 7 years ago
- Meta-Analysis of Robust04 Papers (Yang et al., SIGIR 2019)☆12May 25, 2019Updated 6 years ago
- A repository for Neural Document Ranking Models.☆83Sep 15, 2018Updated 7 years ago
- SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model☆36Aug 2, 2017Updated 8 years ago
- Minimalistic BM25 search engine in C/C++, Java, and nearly 20 other languages☆21Jun 19, 2024Updated last year
- Experimental Git Mirror of "https://sourceforge.net/p/lemur/galago" using "https://github.com/felipec/git-remote-hg"☆13Dec 17, 2020Updated 5 years ago
- A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.☆23Nov 9, 2025Updated 5 months ago
- Fielded Sequential Dependence Model (code and runs)☆32Dec 23, 2015Updated 10 years ago
- ☆63Sep 10, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Improving the effectiveness Lucene's BM25 (and testing it using Yahoo! Answers and Stack Overflow collections)☆16Feb 26, 2022Updated 4 years ago
- ☆34Feb 17, 2021Updated 5 years ago
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆346Oct 10, 2023Updated 2 years ago
- Terrier IR Platform☆273Mar 1, 2026Updated last month
- Resources for the Tutorial on "Utilizing Knowledge Bases in Text-centric Information Retrieval"☆25Sep 18, 2016Updated 9 years ago
- Official repository of the ACM SIGIR 2019 paper: "Fast Approximate Filtering of Search Results Sorted by Attribute" by Franco Maria Nardi…☆14Nov 7, 2019Updated 6 years ago
- Data and Code on Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems (SIGIR 2…☆71Jul 28, 2020Updated 5 years ago
- Tools relating to the CC-News-En Collection☆20Dec 8, 2023Updated 2 years ago
- Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET☆190Oct 24, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Test Collection of Computer Science Papers for Faceted Query by Example☆23Nov 28, 2021Updated 4 years ago
- ☆20Jan 16, 2020Updated 6 years ago
- Jig for the Open-Source IR Replicability Challenge (OSIRRC)☆13Dec 8, 2022Updated 3 years ago
- Dynamic Entity Summarization (DynES)☆20May 10, 2019Updated 6 years ago
- Neural-IR-Explorer: A Content-Focused Tool to Explore Neural Re-Ranking Results☆32Dec 13, 2019Updated 6 years ago
- ☆50Sep 3, 2019Updated 6 years ago
- Anserini is a Lucene toolkit for reproducible information retrieval research☆1,109Updated this week