jonathandunn / corpus_similarity
Measure the similarity of text corpora for 74 languages
☆13Updated last year
Alternatives and similar repositories for corpus_similarity:
Users that are interested in corpus_similarity are comparing it to the libraries listed below
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 3 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆51Updated 2 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- The Universal Anaphora Scorer☆15Updated 8 months ago
- Parser for KAF NAF files written in Python☆16Updated 3 years ago
- Python library to work with ConceptNet offline☆10Updated 2 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆43Updated 6 months ago
- ☆32Updated 3 years ago
- ☆33Updated 3 years ago
- An Interactive Tool for Annotating Discourse Structure and Text Improvement☆16Updated 3 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Updated 7 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 2 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆66Updated 2 years ago
- List of corpora annotated for coreference for different languages☆17Updated 9 months ago
- A python 3 interface for BabelNet https://babelnet.org/☆32Updated 2 years ago
- Reference-less Quality Estimation of Text Simplification Systems☆50Updated last year
- 🌸 Train floret vectors☆18Updated 2 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Tool for parsing and converting various span encoding schemes.☆23Updated last year
- A python module to process data for Frame Semantic Parsing☆24Updated 4 years ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- A temporal ordering system for events and time expressions in written text.☆43Updated 3 years ago
- The Mueller Report Corpus V 0.1☆11Updated 4 years ago
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 9 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 10 months ago
- Interface for reading the Paraphrase Database (PPDB)☆24Updated 7 years ago
- Python framework for processing Universal Dependencies data☆57Updated last week
- This repository includes the code for neural DRS parsing☆27Updated last year
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆95Updated last year