ghpaetzold / massalignLinks
Alignment and annotation for comparable documents.
☆22Updated 7 years ago
Alternatives and similar repositories for massalign
Users that are interested in massalign are comparing it to the libraries listed below
Sorting:
- Efficient Low-Memory Aligner☆146Updated 10 months ago
- Text Simplification System and Dataset☆125Updated 2 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆38Updated 3 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆46Updated 3 months ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Updated 4 years ago
- End-to-end shallow discourse parser☆23Updated 2 years ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆102Updated 3 weeks ago
- ☆32Updated 4 years ago
- ParCourE - Parallel Corpus Explorer☆12Updated 3 years ago
- Exploring Neural Text Simplification☆73Updated 7 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Updated 4 years ago
- Efficient Markov Chain word alignment☆52Updated 4 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆68Updated 2 weeks ago
- Easier Automatic Sentence Simplification Evaluation☆162Updated 2 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆77Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- Appraise code used as part of WMT21 human evaluation campaign☆29Updated 2 weeks ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆58Updated 3 months ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆152Updated last week
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…☆103Updated 2 years ago
- This is the reference implementation of commonly used coreference metrics.☆76Updated 7 years ago
- Data and code used in the 2015 ACL paper, "Ground Truth for Grammatical Error Correction Metrics"☆54Updated 7 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆30Updated 5 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Updated last year
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆61Updated 2 years ago
- ☆32Updated 4 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆17Updated last year