anaistack / cefr-asag-corpusLinks
A corpus of short answers written by learners of English and graded with CEFR levels
☆12Updated 3 years ago
Alternatives and similar repositories for cefr-asag-corpus
Users that are interested in cefr-asag-corpus are comparing it to the libraries listed below
Sorting:
- Multilingual sentence alignment using sentence embeddings☆130Updated last year
- Annotation Tool for Text Simplification Corpora☆17Updated 2 years ago
- A neural word aligner based on multilingual BERT☆359Updated 3 years ago
- Improved Sentence Alignment in Linear Time and Space☆185Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆112Updated 2 weeks ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆38Updated 2 weeks ago
- This packages up data for the Open Multilingual Wordnet☆56Updated 5 months ago
- A collection of text simplification datasets and other resources☆50Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆382Updated 2 years ago
- The University of Pittsburgh English Language Institute Corpus (PELIC) dataset☆24Updated 2 years ago
- Natural Language Processing Research in North American Linguistics Departments☆20Updated 2 weeks ago
- Repository for CEFR-SP corpus and sentence level assessment☆53Updated last year
- Natural language understanding benchmarks for Norwegian☆14Updated 3 months ago
- Utility for behavioral and representational analyses of Language Models☆171Updated 2 months ago
- ☆32Updated 2 years ago
- ☆65Updated 3 months ago
- A modern, interlingual wordnet interface for Python☆274Updated this week
- Open Language Profiles — English profile datasets from CEFR-J☆154Updated 5 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 3 years ago
- Efficient Low-Memory Aligner☆146Updated 10 months ago
- cLang-8 is a dataset for grammatical error correction.☆110Updated 3 years ago
- Automated Semantic Analysis of Discourse Markers☆10Updated 3 years ago
- ☆78Updated 3 months ago
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆29Updated 6 months ago
- A module to compute textual lexical richness (aka lexical diversity).☆110Updated 2 years ago
- A accurate multilingual word aligner based on LaBSE☆24Updated 2 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆203Updated last year
- FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning. Presented at EACL 2023.☆37Updated 2 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 7 months ago