openderock / vocabulary-level-grader
Analyzes the given text and determine what's the vocabulary level based on CEFR levels
☆44Updated 2 years ago
Alternatives and similar repositories for vocabulary-level-grader:
Users that are interested in vocabulary-level-grader are comparing it to the libraries listed below
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.☆45Updated last month
- Open Language Profiles — English profile datasets from CEFR-J☆112Updated 4 years ago
- A list of vocabulary lists☆21Updated 4 years ago
- Exploring the idea of a generic, language agnostic, CEFR level classifier☆21Updated 6 years ago
- Multilingual sentence alignment using sentence embeddings☆106Updated 2 months ago
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 3 months ago
- A tool to find grammar patterns in Chinese text☆24Updated 5 years ago
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- NLP to classify a text's lexile level☆31Updated last month
- Sentence aligner☆109Updated 3 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆37Updated last year
- Tokenizes Chinese texts into words.☆96Updated 2 years ago
- CLDR text segmentation for JavaScript☆38Updated 8 months ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆23Updated 4 years ago
- A pronunciation trainer w/ Python.☆12Updated last year
- Gather modern English word frequencies from all enwiki articles.☆207Updated 10 months ago
- Repository for CEFR-SP corpus and sentence level assessment☆34Updated 4 months ago
- JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.☆66Updated 3 years ago
- Converts English text to IPA notation☆371Updated last year
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆25Updated 2 years ago
- A corpus of short answers written by learners of English and graded with CEFR levels☆10Updated 3 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆87Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆111Updated 4 months ago
- 開放漢語字典 - 現代漢語字音數據庫☆21Updated 4 years ago
- api to retrieve word definitions and other info☆61Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆234Updated 2 years ago
- 粵文語料篩選器 Cantonese text filter☆36Updated last month
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆75Updated 3 years ago
- Monolingual wordlists with pronunciation information in IPA☆574Updated last year