anaistack / cefr-asag-corpusLinks
A corpus of short answers written by learners of English and graded with CEFR levels
☆12Updated 3 years ago
Alternatives and similar repositories for cefr-asag-corpus
Users that are interested in cefr-asag-corpus are comparing it to the libraries listed below
Sorting:
- Annotation Tool for Text Simplification Corpora☆17Updated last year
- A neural word aligner based on multilingual BERT☆355Updated 3 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆36Updated 5 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆375Updated last year
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆27Updated 3 months ago
- Multilingual sentence alignment using sentence embeddings☆122Updated 9 months ago
- OpusFilter - Parallel corpus processing toolkit☆109Updated 3 weeks ago
- A collection of text simplification datasets and other resources☆46Updated 11 months ago
- Improved Sentence Alignment in Linear Time and Space☆180Updated 2 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- Natural Language Processing Research in North American Linguistics Departments☆21Updated 5 months ago
- Python Multilingual Ucrel Semantic Analysis System☆31Updated last year
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated 2 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆110Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆161Updated last year
- https://sites.google.com/site/multidimensionaltagger☆36Updated last year
- Efficient Low-Memory Aligner☆146Updated 7 months ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆362Updated 2 years ago
- ☆64Updated last week
- Neural CRF Model for Sentence Alignment in Text Simplification☆68Updated 7 months ago
- Natural language understanding benchmarks for Norwegian☆14Updated this week
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆66Updated last month
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- Repository for CEFR-SP corpus and sentence level assessment☆49Updated 11 months ago
- A simple toolkit for conducting analyses using corpus methods☆26Updated 3 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆183Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Automated Semantic Analysis of Discourse Markers☆10Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆156Updated 3 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆252Updated 2 years ago