FinNLP / humannamesLinks
π¦ A list, huge one (~200K) of human male/female first/last names.
β52Updated last year
Alternatives and similar repositories for humannames
Users that are interested in humannames are comparing it to the libraries listed below
Sorting:
- Machine-readable lists of lemma-token pairs in 23 languages.β341Updated 3 years ago
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionaryβ79Updated 3 years ago
- All the words from Google Books, sorted by frequencyβ117Updated 2 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.β13Updated 2 years ago
- Lists of most-frequently-used english words / nouns / verbs etc.β72Updated 5 years ago
- A list of the most popular English words.β381Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.β52Updated 4 years ago
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.β24Updated last year
- Faster, modernized fork of the language identification tool langid.pyβ56Updated 7 months ago
- International Phonetic Alphabet RESTful API for wordsβ49Updated 5 years ago
- roll a wikipedia dump into mongoβ244Updated last year
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the datβ¦β153Updated 6 months ago
- Scraper for downloading the entire ebooks repository of project Gutenbergβ151Updated this week
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (httpβ¦β24Updated 8 years ago
- RosaeNLG is a Natural Language Generation library for node.js and browser rendering, based on the Pug template engine.β100Updated 6 months ago
- List of common stop words in various languages.β337Updated 2 years ago
- Data for the International Phonetic Alphabet (IPA)β28Updated 2 years ago
- an opinionated assembly of wordnet for javascriptβ55Updated 8 years ago
- β100Updated this week
- The Unicode Cookbook for Linguistsβ54Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Foβ¦β104Updated last month
- A Corpus Data Retrieval Index using Lucene for Look-Upsβ17Updated this week
- Offline database of synonyms/thesaurusβ198Updated last year
- An off-the-shelf client-side language identification module for JavaScript.β16Updated 11 years ago
- WordNet-LMF formatsβ22Updated this week
- Pronunciation dictionaries for several languages, based on Wiktionary data.β20Updated 3 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.β46Updated 7 years ago
- Verb forms dictionaryβ66Updated 7 years ago
- Script used to collect entry data from Urban Dictionaryβ33Updated 9 years ago
- Gather modern English word frequencies from all enwiki articles.β218Updated last year