harshnative / words-dataset
over 6_00_000 english words data set arranged with each words frequency
☆17Updated 3 years ago
Alternatives and similar repositories for words-dataset
Users that are interested in words-dataset are comparing it to the libraries listed below
Sorting:
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.☆55Updated 5 months ago
- Split {Japanese, English} text into sentences.☆125Updated last year
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆68Updated last year
- 🏆 • 5050 most frequent words in 109 languages☆42Updated 2 years ago
- Lightweight string similarity function for javascript☆100Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 7 months ago
- Unofficial Python API for NAVER Papago TTS☆32Updated 2 years ago
- Offline database of synonyms/thesaurus☆195Updated last year
- Gather modern English word frequencies from all enwiki articles.☆212Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆246Updated 2 years ago
- Node module wrapper for WordNet dictionary.☆54Updated 3 years ago
- A modern, interlingual wordnet interface for Python☆243Updated this week
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- A simple phonetic respelling for the English language☆10Updated last month
- CLDR text segmentation for JavaScript☆38Updated last year
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆54Updated 8 years ago
- a CSV of every english word, part of speech, and definition. as well as a web scraping script that generates that data for you☆115Updated 2 years ago
- Spelling corrector in python☆482Updated 4 months ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆72Updated 5 months ago
- A list of vocabulary lists☆21Updated 4 years ago
- Downloadable database of german verbs and conjugations as found on wiktionary.org☆27Updated 2 years ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆20Updated 3 years ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆17Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆54Updated 10 years ago
- Verb forms dictionary☆66Updated 7 years ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆45Updated 2 years ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- ☆56Updated 2 years ago
- Spanish-English-Spanish XML dictionary☆44Updated 10 months ago
- A list of the most popular English words.☆373Updated 2 years ago