ghpaetzold / massalign
Alignment and annotation for comparable documents.
☆22Updated 6 years ago
Alternatives and similar repositories for massalign:
Users that are interested in massalign are comparing it to the libraries listed below
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆43Updated 6 months ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- Appraise code used as part of WMT21 human evaluation campaign☆24Updated 2 months ago
- Efficient Low-Memory Aligner☆143Updated 3 months ago
- ParCourE - Parallel Corpus Explorer☆12Updated 3 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆40Updated 4 years ago
- Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…☆14Updated 2 months ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Updated 4 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆16Updated 9 months ago
- End-to-end shallow discourse parser☆20Updated last year
- ☆32Updated 3 years ago
- Repo for the simplified text alignment tools.☆21Updated 4 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆74Updated 3 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆26Updated 4 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- ☆10Updated 2 years ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆67Updated 3 months ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆24Updated 11 months ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆94Updated this week
- ☆28Updated 10 months ago
- VerbNet semantic parser and related utilities☆36Updated 2 years ago
- Exploring Neural Text Simplification☆73Updated 7 years ago
- Repository for DISRPT2023 shared task☆16Updated 9 months ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 3 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated last year
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- Repository for DISRPT2021 shared task☆15Updated 2 years ago