openlanguageprofiles / olp-en-cefrj
Open Language Profiles — English profile datasets from CEFR-J
☆103Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for olp-en-cefrj
- Repository for CEFR-SP corpus and sentence level assessment☆32Updated 2 months ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆43Updated last year
- A corpus of short answers written by learners of English and graded with CEFR levels☆10Updated 2 years ago
- Multilingual sentence alignment using sentence embeddings☆101Updated 2 weeks ago
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- Converts English text to IPA notation☆364Updated last year
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models for Japanese and other languages☆48Updated last month
- cLang-8 is a dataset for grammatical error correction.☆103Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆204Updated 8 months ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆144Updated 7 months ago
- Machine-Translation-based sentence alignment tool for parallel text☆300Updated 3 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year
- ☆67Updated 3 months ago
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.☆42Updated 2 years ago
- NLP to classify a text's lexile level☆30Updated 2 years ago
- Sentence aligner☆108Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆230Updated 2 years ago
- British English pronunciation dictionary☆89Updated 7 years ago
- BERT-based GEC tagging for Japanese☆16Updated last year
- Unidic packaged for installation via pip.☆79Updated last year
- Software for phonetic transcription of English and Finnish, and IPA tools☆15Updated 8 years ago
- A Python Wiktionary Parser☆358Updated 10 months ago
- 🔍 🔎 For English Learners☆45Updated last year
- ☆22Updated last year
- Verb forms dictionary☆60Updated 7 years ago
- Common English Vocabulary Word List☆289Updated 5 years ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆50Updated last year
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- vocabulary lists☆79Updated 5 years ago
- Tokenizes Chinese texts into words.☆95Updated last year