openderocknlp / vocabulary-level-grader
Analyzes the given text and determine what's the vocabulary level based on CEFR levels
☆45Updated 2 years ago
Alternatives and similar repositories for vocabulary-level-grader:
Users that are interested in vocabulary-level-grader are comparing it to the libraries listed below
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.☆52Updated 4 months ago
- Open Language Profiles — English profile datasets from CEFR-J☆122Updated 5 years ago
- A list of vocabulary lists☆21Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆116Updated 5 months ago
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 7 months ago
- Gather modern English word frequencies from all enwiki articles.☆212Updated last year
- NLP to classify a text's lexile level☆35Updated 4 months ago
- British English pronunciation dictionary☆95Updated 7 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆91Updated 2 years ago
- Repository for CEFR-SP corpus and sentence level assessment☆40Updated 7 months ago
- A modern, interlingual wordnet interface for Python☆243Updated last week
- *.mdx/*.mdd interpreter js implements, support mdict index file☆168Updated last month
- Translation Memory Open-source Purifier☆34Updated 2 years ago
- Converts English text to IPA notation☆381Updated last year
- CLDR text segmentation for JavaScript☆38Updated 11 months ago
- Tokenizes Chinese texts into words.☆96Updated 2 years ago
- Download the pronunciation mp3 audio for 119,376 unique English words/terms☆198Updated 6 years ago
- Exploring the idea of a generic, language agnostic, CEFR level classifier☆22Updated 7 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆309Updated 4 years ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆54Updated 8 years ago
- A Python Wiktionary Parser☆358Updated 2 months ago
- A corpus of short answers written by learners of English and graded with CEFR levels☆11Updated 3 years ago
- Sentence aligner☆112Updated 3 years ago
- Offline database of synonyms/thesaurus☆195Updated last year
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆33Updated 2 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 3 years ago
- [public][generated-english-phrasal-verbs]☆48Updated 7 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆244Updated 2 years ago
- Monolingual wordlists with pronunciation information in IPA☆610Updated last month
- JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.☆66Updated 3 years ago