openderocknlp / vocabulary-level-graderLinks
Analyzes the given text and determine what's the vocabulary level based on CEFR levels
☆48Updated 2 years ago
Alternatives and similar repositories for vocabulary-level-grader
Users that are interested in vocabulary-level-grader are comparing it to the libraries listed below
Sorting:
- Open Language Profiles — English profile datasets from CEFR-J☆154Updated 5 years ago
- A list of vocabulary lists☆22Updated 5 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆33Updated last year
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- Multilingual sentence alignment using sentence embeddings☆130Updated last year
- Download the pronunciation mp3 audio for 119,376 unique English words/terms☆218Updated 6 years ago
- Converts English text to IPA notation☆392Updated 2 years ago
- Tokenizes Chinese texts into words.☆100Updated 2 years ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆49Updated 9 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆52Updated 2 years ago
- A corpus of short answers written by learners of English and graded with CEFR levels☆12Updated 3 years ago
- British English pronunciation dictionary☆96Updated 8 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆350Updated 3 years ago
- Monolingual wordlists with pronunciation information in IPA☆697Updated 6 months ago
- A Python Wiktionary Parser☆367Updated 4 months ago
- A pronunciation trainer w/ Python.☆14Updated 2 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆100Updated 3 months ago
- Massively multilingual pronunciation mining☆357Updated 3 months ago
- Text to IPA converter in JavaScript☆58Updated 3 years ago
- Lexical database for ~70k English words with morphological variables☆47Updated 3 years ago
- ☆16Updated 2 years ago
- Sentence aligner☆121Updated 4 years ago
- CLDR text segmentation for JavaScript☆38Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆179Updated 5 months ago
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆76Updated 6 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆255Updated 3 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- Translation Memory Open-source Purifier☆35Updated 3 years ago