anaistack / cefr-asag-corpusLinks
A corpus of short answers written by learners of English and graded with CEFR levels
☆12Updated 3 years ago
Alternatives and similar repositories for cefr-asag-corpus
Users that are interested in cefr-asag-corpus are comparing it to the libraries listed below
Sorting:
- Annotation Tool for Text Simplification Corpora☆17Updated last year
- Improved Sentence Alignment in Linear Time and Space☆180Updated 2 years ago
- A neural word aligner based on multilingual BERT☆354Updated 3 years ago
- Multilingual sentence alignment using sentence embeddings☆121Updated 9 months ago
- Repository for CEFR-SP corpus and sentence level assessment☆47Updated 10 months ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆36Updated 5 months ago
- Natural Language Processing Research in North American Linguistics Departments☆21Updated 4 months ago
- OpusFilter - Parallel corpus processing toolkit☆109Updated this week
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆373Updated last year
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆28Updated 2 months ago
- Natural language understanding benchmarks for Norwegian☆14Updated last year
- A collection of text simplification datasets and other resources☆46Updated 11 months ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- Parallel corpora for the biomedical domain☆50Updated last year
- https://sites.google.com/site/multidimensionaltagger☆36Updated last year
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆38Updated 9 months ago
- A simple toolkit for conducting analyses using corpus methods☆26Updated 3 years ago
- Utility for behavioral and representational analyses of Language Models☆157Updated this week
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆182Updated last year
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆359Updated 2 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆119Updated 4 months ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- Split bib files for anthology bibliography for overleaf☆11Updated 11 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- cLang-8 is a dataset for grammatical error correction.☆107Updated 3 years ago
- a tool for calcualting character n-gram F score☆73Updated 2 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆144Updated 5 years ago
- ☆167Updated last year
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated 2 years ago
- Sentence aligner☆116Updated 4 years ago