JMMackenzie / CC-News-ToolsLinks
Tools relating to the CC-News-En Collection
☆20Updated last year
Alternatives and similar repositories for CC-News-Tools
Users that are interested in CC-News-Tools are comparing it to the libraries listed below
Sorting:
- HiCAL is a system for efficient high-recall retrieval with an adaptable assessing interface.☆37Updated 2 years ago
- A toolkit for end-to-end neural ad hoc retrieval☆96Updated 10 months ago
- Source code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19☆46Updated 6 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 3 years ago
- Fusion for TREC run files with popular fusion techniques☆21Updated 2 years ago
- scripts to download and standardize trec query and document sets☆48Updated 5 years ago
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆47Updated 4 years ago
- Anserini notebooks☆69Updated 2 years ago
- Tools for the TREC CAsT benchmark☆28Updated 2 years ago
- Tools for working with the TREC CAR dataset.☆35Updated 3 years ago
- ☆34Updated 4 years ago
- Automatically detect errors in annotated corpora.☆47Updated last year
- Standalone Neural Ranking Model (SNRM)☆76Updated 6 years ago
- Information Retrieval Relevance Judging System☆29Updated 3 years ago
- Submission archive for the MS MARCO document ranking leaderboard☆30Updated last year
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆59Updated 3 years ago
- Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"☆15Updated 3 years ago
- source code of bison☆26Updated 4 years ago
- ☆38Updated 2 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)☆77Updated 6 years ago
- An end-to-end neural ad-hoc ranking pipeline.☆151Updated 2 months ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆76Updated 3 years ago
- Generalizing Natural Language Analysis through Span-relation Representations☆91Updated 2 years ago
- Multi-stage passage ranking: monoBERT + duoBERT☆112Updated 4 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 4 years ago
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆54Updated 3 years ago
- Reproducibility of the TAGME entity linking system☆60Updated 6 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆170Updated 4 years ago
- ☆54Updated 3 years ago
- Repository for KPTimes corpus☆35Updated 4 months ago