openlanguageprofiles / olp-en-cefrjLinks
Open Language Profiles — English profile datasets from CEFR-J
☆155Updated 5 years ago
Alternatives and similar repositories for olp-en-cefrj
Users that are interested in olp-en-cefrj are comparing it to the libraries listed below
Sorting:
- Repository for CEFR-SP corpus and sentence level assessment☆55Updated last year
- Multilingual sentence alignment using sentence embeddings☆131Updated last year
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆48Updated 2 years ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆41Updated last year
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- A corpus of short answers written by learners of English and graded with CEFR levels☆12Updated 3 years ago
- British English pronunciation dictionary☆96Updated 8 years ago
- Converts English text to IPA notation☆393Updated 2 years ago
- 30,000 most common English words with Chinese dictionary explanations in order of frequency.☆193Updated 5 years ago
- ☆32Updated 2 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆52Updated 2 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- Improved Sentence Alignment in Linear Time and Space☆186Updated 2 years ago
- Sentence aligner☆121Updated 4 years ago
- NLP to classify a text's lexile level☆41Updated 11 months ago
- A modern, interlingual wordnet interface for Python☆276Updated this week
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆29Updated 6 months ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 3 months ago
- A large parallel corpus of English and Japanese☆87Updated 8 years ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆214Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆33Updated last year
- Lexical database for ~70k English words with morphological variables☆48Updated 3 years ago
- The University of Pittsburgh English Language Institute Corpus (PELIC) dataset☆25Updated 2 years ago
- Download the pronunciation mp3 audio for 119,376 unique English words/terms☆219Updated 6 years ago
- Unidic packaged for installation via pip.☆106Updated 9 months ago
- Exploring the idea of a generic, language agnostic, CEFR level classifier☆23Updated 7 years ago
- A Python Wiktionary Parser☆367Updated 4 months ago
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.☆73Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆30Updated 2 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆51Updated 2 years ago