apluslms / greedy-string-tilingLinks
Python package implementing the greedy string tiling algorithm for comparing string similarity
☆12Updated 2 years ago
Alternatives and similar repositories for greedy-string-tiling
Users that are interested in greedy-string-tiling are comparing it to the libraries listed below
Sorting:
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆63Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 3 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- A Wikipedia-based summarization dataset☆14Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- A first cut into exploring the use of dependency links for building Text Graphs, that, among other things, with help of a centrality algo…☆31Updated last year
- Enhaced version of Wikiextrator: A wikipedia dumps extractor☆21Updated 3 weeks ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 3 years ago
- code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith☆14Updated 4 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆47Updated 10 months ago
- Dataset accompanying the SPECTER model☆140Updated 2 years ago
- Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles☆46Updated last year
- Submission archive for the MS MARCO passage ranking leaderboard☆13Updated 2 years ago
- Bayesian Assessment of Hypotheses☆25Updated 2 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Updated last year
- ☆17Updated 2 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- Knowledge graph based information retrieval☆13Updated 6 years ago
- ☆13Updated 3 years ago
- allennlp-light is a port of AllenNLP's core modules and nn portions into a standalone package with minimum dependencies☆55Updated 3 years ago
- Frame Semantic Parser based on T5 and FrameNet☆62Updated 2 years ago
- ☆29Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆105Updated last year
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Updated 4 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆71Updated 2 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆24Updated last year
- ☆46Updated 3 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- Contrastive Fact Verification☆73Updated 3 years ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 4 years ago