jonathandunn / corpus_similarity
Measure the similarity of text corpora for 74 languages
☆13Updated last year
Alternatives and similar repositories for corpus_similarity:
Users that are interested in corpus_similarity are comparing it to the libraries listed below
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- Python library to work with ConceptNet offline☆10Updated 2 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Updated 6 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- Interface for reading the Paraphrase Database (PPDB)☆24Updated 6 years ago
- ☆32Updated 3 years ago
- Reference-less Quality Estimation of Text Simplification Systems☆49Updated last year
- List of corpora annotated for coreference for different languages☆17Updated 6 months ago
- Python wrapper for ClausIE.☆26Updated 3 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Extension of the SentenceSimplification project☆59Updated this week
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆44Updated 4 months ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated 11 months ago
- Python framework for processing Universal Dependencies data☆55Updated 3 weeks ago
- The Universal Anaphora Scorer☆15Updated 6 months ago
- Universal Proposition Banks for Multilingual Semantic Role Labeling☆101Updated 2 years ago
- A web interface to understand language-specific BERT-models☆17Updated 10 months ago
- A python module to process data for Frame Semantic Parsing☆23Updated 4 years ago
- ☆33Updated 3 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆66Updated 2 years ago
- Data and download script to accompany LREC2020 paper "Automated Fact-Checking of Claims from Wikipedia"☆13Updated last year
- spaCy-to-naf converter☆21Updated 9 months ago
- A python 3 interface for BabelNet https://babelnet.org/☆32Updated 2 years ago
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- Active learning for coreference resolution using discrete annotation☆12Updated 2 years ago
- An open information extraction system that provides compact extractions☆91Updated 3 years ago