harshnative / words-datasetLinks
over 6_00_000 english words data set arranged with each words frequency
ā17Updated 3 years ago
Alternatives and similar repositories for words-dataset
Users that are interested in words-dataset are comparing it to the libraries listed below
Sorting:
- š ⢠5050 most frequent words in 109 languagesā42Updated 2 years ago
- A parallel corpus of Sorani, Kurmanji and Englishā13Updated 4 years ago
- English Lemma Database - Compiled by Referencing British National Corpusā31Updated 8 months ago
- PyMultiDictionary is a dictionary module that gets meanings, translations, synonyms, and antonyms of words in 20 different languagesā50Updated last week
- A list of vocabulary listsā21Updated 4 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where ā¦ā12Updated last year
- A list of awesome Machine Translation frameworks, libraries, software and papersā192Updated 10 months ago
- An NLP pipeline for Hebrewā38Updated last week
- š A forced aligner intended for synchronization of narrated textā93Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyā160Updated this week
- RosaeNLG is a Natural Language Generation library for node.js and browser rendering, based on the Pug template engine.ā99Updated 5 months ago
- 3000+ machine-readable open source dictionaries distributed by the Applied Computational Linguistics lab at the University of Augsburg, Gā¦ā12Updated last year
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.ā107Updated 2 weeks ago
- š Additional lookup tables and data resources for spaCyā105Updated last week
- Verb forms dictionaryā66Updated 7 years ago
- A library for fetching and reading Tatoeba's weekly exportsā23Updated last year
- A repository of words in multiple languages sorted by their frequencyā11Updated last year
- Offline bilingual dictionaries made using data from Wiktionaryā55Updated 10 years ago
- Convert number words (eg. twenty one) to numeric digits (21)ā176Updated last year
- A Python library for detecting and filtering profanityā161Updated 4 years ago
- JavaScript port of SymSpell for Node.jsā13Updated 2 years ago
- Probably the most advanced command-line english dictionary ever.ā38Updated 5 years ago
- Node module wrapper for WordNet dictionary.ā54Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.ā249Updated 2 years ago
- š¦ JavaScript library to return the synonyms of the word ~ 27779 wordsā67Updated 2 years ago
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.ā56Updated 6 months ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python codeā35Updated 4 months ago
- Tool to generate paraphrases of sentences in many languages.ā84Updated 3 years ago
- Repo for the Unified Verbs Index Projectā11Updated 2 months ago
- Accurately find/replace/remove emojis in text stringsā162Updated last year