harshnative / words-datasetLinks
over 6_00_000 english words data set arranged with each words frequency
☆21Updated 4 years ago
Alternatives and similar repositories for words-dataset
Users that are interested in words-dataset are comparing it to the libraries listed below
Sorting:
- A list of vocabulary lists☆22Updated 5 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆32Updated last year
- Gather modern English word frequencies from all enwiki articles.☆225Updated last year
- Transliteration for languages and dialects☆43Updated 3 years ago
- A Python library for detecting and filtering profanity☆166Updated 4 years ago
- Offline database of synonyms/thesaurus☆202Updated last year
- Converts English text to IPA notation☆390Updated 2 years ago
- A collection of fun and interesting words in English used in the Insanity Jam's Game Idea Generator☆13Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 3 months ago
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.☆69Updated 10 months ago
- Javascript libraries to process text: Arabic, Japanese, etc.☆51Updated last year
- Split {Japanese, English} text into sentences.☆134Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆176Updated 4 months ago
- A learning JavaScript dictionary-based word prediction / autocomplete / suggestion library.☆40Updated 2 years ago
- A simple phonetic respelling for the English language☆10Updated 2 weeks ago
- British English pronunciation dictionary☆95Updated 7 years ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆47Updated 2 years ago
- All the words from Google Books, sorted by frequency☆118Updated 2 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆43Updated 8 months ago
- JavaScript port of SymSpell for Node.js☆13Updated 3 years ago
- Aksharamukha Python Library☆52Updated 8 months ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆20Updated 3 years ago
- A list of the most popular English words.☆383Updated 3 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 10 months ago
- An NLP pipeline for Hebrew☆39Updated 3 months ago
- Text to IPA converter in JavaScript☆58Updated 3 years ago
- a CSV of every english word, part of speech, and definition. as well as a web scraping script that generates that data for you☆129Updated 2 years ago
- Monolingual wordlists with pronunciation information in IPA☆676Updated 4 months ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆343Updated 3 years ago