harshnative / words-dataset
over 6_00_000 english words data set arranged with each words frequency
☆13Updated 3 years ago
Alternatives and similar repositories for words-dataset:
Users that are interested in words-dataset are comparing it to the libraries listed below
- A Python library for detecting and filtering profanity☆162Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆238Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated 3 months ago
- 📂 Additional lookup tables and data resources for spaCy☆100Updated 3 weeks ago
- Tool to generate paraphrases of sentences in many languages.☆83Updated 3 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆10Updated 9 months ago
- Transliteration for languages and dialects☆42Updated 2 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆123Updated 8 months ago
- A list of vocabulary lists☆21Updated 4 years ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆73Updated 2 months ago
- English Lemma Database - Compiled by Referencing British National Corpus☆29Updated 4 months ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆71Updated 2 months ago
- ☆55Updated last year
- Verb forms dictionary☆63Updated 7 years ago
- A universal Python library for detecting and filtering profanity☆76Updated 2 months ago
- Parse numbers written in natural language☆109Updated 3 months ago
- Text2Text Language Modeling Toolkit☆298Updated last month
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- PyMultiDictionary is a dictionary module that gets meanings, translations, synonyms, and antonyms of words in 20 different languages☆47Updated 3 months ago
- A parallel corpus of Sorani, Kurmanji and English☆10Updated 4 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Updated 10 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆122Updated last month
- ☆84Updated last month
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity