Hadoop tools for manipulating ClueWeb collections
☆26Jul 15, 2016Updated 9 years ago
Alternatives and similar repositories for clueweb
Users that are interested in clueweb are comparing it to the libraries listed below
Sorting:
- A toolkit for simulating interactive information retrieval☆21Sep 7, 2018Updated 7 years ago
- Rank-Biased Precision, Overlap, Recall, and Alignment☆12Feb 18, 2025Updated last year
- TREC Core track☆11Jul 5, 2017Updated 8 years ago
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Jul 7, 2019Updated 6 years ago
- Jig for the Open-Source IR Replicability Challenge (OSIRRC)☆13Dec 8, 2022Updated 3 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- Systematic Review Query Visualisation and Understanding Interface☆17Dec 5, 2025Updated 2 months ago
- Toolkit for domain-specific information retrieval experimentation☆19Updated this week
- ☆25Feb 20, 2026Updated last week
- Tool for comparing two ranked lists (TREC run files)☆20Nov 9, 2022Updated 3 years ago
- Fusion for TREC run files with popular fusion techniques☆21Aug 26, 2022Updated 3 years ago
- ☆19Jan 16, 2020Updated 6 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆51Jan 11, 2016Updated 10 years ago
- Tools for the TREC CAsT benchmark☆28Dec 15, 2022Updated 3 years ago
- A Python interface to PISA☆37Sep 23, 2025Updated 5 months ago
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆31Jan 14, 2023Updated 3 years ago
- Common Index File Format to to support interoperability between open-source IR engines☆40Sep 19, 2024Updated last year
- A RankLib based Solr Learning to Rank Plugin☆29Jul 7, 2022Updated 3 years ago
- Common web archive utility code.☆61Feb 6, 2026Updated 3 weeks ago
- The classic movies redux with machine learning using TensorFlow and Keras.☆11Feb 12, 2019Updated 7 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86May 12, 2021Updated 4 years ago
- A toolkit for end-to-end neural ad hoc retrieval☆97Aug 20, 2024Updated last year
- Website for the TREC Deep Learning Track 2019☆86Jun 12, 2023Updated 2 years ago
- Pivotal GemFire XD☆13Nov 18, 2020Updated 5 years ago
- The MSR FastRDFStore Package is designed for creating an in-memory index of RDF triples, implemented as a WCF service in C#, and consists…☆41Jun 12, 2023Updated 2 years ago
- Example application that checks the Weather Underground API and controls a physical weather dashboard using a cloudBit.☆11Jan 6, 2017Updated 9 years ago
- Code and data for the Walert large language model-based chatbot☆12Aug 14, 2025Updated 6 months ago
- IPython Notebook for Sentiment Classification☆10Nov 12, 2014Updated 11 years ago
- Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.☆11Updated this week
- Fake NEWS detector using LIAR dataset.☆11Aug 19, 2019Updated 6 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Identifying Nuances in Fake News vs. Satire: Using Semantic and Linguistic Cues (NLP4IF, EMNLP-IJCNLP 2019)☆11Dec 21, 2020Updated 5 years ago
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Security research organization dedicated to finding low hanging, critical, vulnerabilities.☆15May 12, 2022Updated 3 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Dec 13, 2018Updated 7 years ago
- Containerfile for the Vanilla OS Desktop+Nvidia image.☆16Feb 5, 2026Updated 3 weeks ago
- A Hadoop toolkit for web-scale information retrieval research☆85Dec 12, 2014Updated 11 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- pyndri is a Python interface to the Indri search engine.☆89Jun 21, 2022Updated 3 years ago