senisioi / enntt-release
This is a monolingual English corpus of native, non-native and (human) translated texts extracted from the European Parliament.
☆9Updated 3 years ago
Alternatives and similar repositories for enntt-release:
Users that are interested in enntt-release are comparing it to the libraries listed below
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- ParCourE - Parallel Corpus Explorer☆12Updated 3 years ago
- Repository for DISRPT2023 shared task☆17Updated 8 months ago
- Appraise evaluation system for manual evaluation of machine translation output☆74Updated 3 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆30Updated last month
- Appraise code used as part of WMT21 human evaluation campaign☆24Updated last month
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- Automated Semantic Analysis of Discourse Markers☆10Updated 2 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- Exploring Neural Text Simplification☆73Updated 7 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 2 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Efficient Low-Memory Aligner☆143Updated 3 months ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆24Updated 11 months ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 5 years ago
- Data Sets and Models for Evaluation of Lexical Semantic Change Detection☆28Updated 2 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- ☆23Updated 5 years ago
- Code and data for paper Colorless Green Recurrent Networks Dream Hierarchically☆92Updated 3 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆16Updated 9 months ago
- End-to-end shallow discourse parser☆20Updated last year
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- a tool for calcualting character n-gram F score☆72Updated 2 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Updated 6 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Updated 4 years ago