FinNLP / humannamesLinks
π¦ A list, huge one (~200K) of human male/female first/last names.
β54Updated last year
Alternatives and similar repositories for humannames
Users that are interested in humannames are comparing it to the libraries listed below
Sorting:
- Machine-readable lists of lemma-token pairs in 23 languages.β342Updated 3 years ago
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionaryβ79Updated 4 years ago
- an opinionated assembly of wordnet for javascriptβ55Updated 8 years ago
- Jason Riggle's chart of phonological features in JSON format + extrasβ54Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.β14Updated 2 years ago
- Offline database of synonyms/thesaurusβ202Updated last year
- An advanced, extensible web front-end for the Manatee-open corpus search engineβ73Updated last week
- Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and moreβ20Updated 6 years ago
- pure javascript lstm rnn implementation based on ocropusβ39Updated 10 years ago
- wordnik python3 libraryβ79Updated last year
- English lemmatizerβ67Updated 2 years ago
- The largest English-language thesaurusβ303Updated this week
- Stylometric Data Mining Library with a focus on identifying Satoshi Nakamoto as a case study.β28Updated last year
- Audio (and video) player for oTranscribeβ27Updated 9 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each pageβ¦β40Updated last year
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the datβ¦β158Updated 8 months ago
- Training scripts for Argos Translateβ141Updated 2 weeks ago
- All the words from Google Books, sorted by frequencyβ118Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.β224Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.β53Updated 4 years ago
- This project represents the 300-dimensional word vectors from word2vec as JSON.β128Updated 8 years ago
- GramadΓ‘n: a computational grammar of Irishβ15Updated 2 years ago
- Wikipedia Bilingual Reference Data (English)β16Updated 9 years ago
- The Unicode Cookbook for Linguistsβ56Updated 4 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.β34Updated 2 years ago
- Automatically exported from code.google.com/p/guess-languageβ52Updated last year
- WordNet in JSON format.β93Updated 5 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).β64Updated last week
- Interactive visualization of Wiktionary words and etymologies.β94Updated last month
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Updated last year