Improving the effectiveness Lucene's BM25 (and testing it using Yahoo! Answers and Stack Overflow collections)
☆16Feb 26, 2022Updated 4 years ago
Alternatives and similar repositories for AccurateLuceneBM25
Users that are interested in AccurateLuceneBM25 are comparing it to the libraries listed below
Sorting:
- A clone of indri-5.12 with minor customizations.☆25Sep 23, 2024Updated last year
- Open-Source Information Retrieval Reproducibility Challenge☆51Jan 11, 2016Updated 10 years ago
- Experimental Git Mirror of "https://sourceforge.net/p/lemur/galago" using "https://github.com/felipec/git-remote-hg"☆13Dec 17, 2020Updated 5 years ago
- Lucene for Information Retrieval☆50Jan 1, 2023Updated 3 years ago
- TREC Real-Time Summarization Tools☆15Jul 19, 2017Updated 8 years ago
- Python binding to the KrovetzStemmer package (C++ version)☆13Feb 12, 2023Updated 3 years ago
- Tools for working with the TREC CAR dataset.☆36Jul 12, 2025Updated 7 months ago
- Entity Linking in Queries: Efficiency vs. Effectiveness☆18Nov 16, 2017Updated 8 years ago
- ☆18Jul 20, 2021Updated 4 years ago
- scripts to download and standardize trec query and document sets☆48Aug 7, 2019Updated 6 years ago
- Tool for comparing two ranked lists (TREC run files)☆20Nov 9, 2022Updated 3 years ago
- Python implementation of nonparametric nearest-neighbor-based estimators for divergences between distributions.☆48Mar 13, 2017Updated 8 years ago
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆52Mar 3, 2025Updated last year
- Source code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19☆48Apr 30, 2019Updated 6 years ago
- Exploratory topic modeling with distributional semantics and interactive visualization☆18Jan 11, 2017Updated 9 years ago
- Flexible classic and NeurAl Retrieval Toolkit☆223Jun 28, 2025Updated 8 months ago
- Natural Logic Inference for Common Sense Reasoning☆61Nov 6, 2018Updated 7 years ago
- Reproducibility of the TAGME entity linking system☆60May 10, 2019Updated 6 years ago
- A tool for scraping tweet ids from the Twitter website.☆31Feb 23, 2017Updated 9 years ago
- For the paper: "Semi-Supervised Structured Prediction with Neural CRF Autoencoder"☆26Aug 7, 2017Updated 8 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Sep 30, 2015Updated 10 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Sep 27, 2016Updated 9 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Aug 8, 2016Updated 9 years ago
- High Dimensional Approximate Near(est) Neighbor☆34Sep 5, 2017Updated 8 years ago
- SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model☆36Aug 2, 2017Updated 8 years ago
- OSoMe API mashups☆11Jan 29, 2019Updated 7 years ago
- Rank Aggregation Algorithms☆12Jul 22, 2014Updated 11 years ago
- Exploiting entity linking in queries for entity retrieval☆81May 10, 2019Updated 6 years ago
- Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf☆38Aug 23, 2018Updated 7 years ago
- Vespa application making an index of the CORD-19 dataset.☆40Jul 8, 2025Updated 8 months ago
- Elasticsearch plugin for Sentiment Analysis using Stanford CoreNLP☆11Oct 17, 2018Updated 7 years ago
- Implementation of data dimensionality reduction algorithms SVD and CUR without using library functions.☆10Jul 24, 2017Updated 8 years ago
- Kamusal veri kaynaklari ve bu verileri programatik olarak cekmek icin olusturulmus depo☆11Jan 4, 2021Updated 5 years ago
- Fielded Sequential Dependence Model (code and runs)☆32Dec 23, 2015Updated 10 years ago
- Command-line tool for building Gephi force-directed graph diagrams.☆10Nov 10, 2017Updated 8 years ago
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 9 years ago
- MS Marco Entity Annotations Disambiguation☆13May 19, 2023Updated 2 years ago
- Machine learning workshop presented at GLBIO 2016☆11May 17, 2016Updated 9 years ago
- ☆11Aug 4, 2022Updated 3 years ago