FinNLP / humannamesLinks
📦 A list, huge one (~200K) of human male/female first/last names.
☆56Updated 2 years ago
Alternatives and similar repositories for humannames
Users that are interested in humannames are comparing it to the libraries listed below
Sorting:
- Scraper for downloading the entire ebooks repository of project Gutenberg☆155Updated last week
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Updated 2 years ago
- Jason Riggle's chart of phonological features in JSON format + extras☆54Updated last year
- An off-the-shelf client-side language identification module for JavaScript.☆16Updated 11 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆358Updated 4 years ago
- pure javascript lstm rnn implementation based on ocropus☆38Updated 11 years ago
- Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more☆20Updated 7 years ago
- Offline database of synonyms/thesaurus☆208Updated 2 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆78Updated this week
- All the words from Google Books, sorted by frequency☆127Updated 2 years ago
- Data for the International Phonetic Alphabet (IPA)☆33Updated 3 years ago
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆24Updated 2 years ago
- Faster, modernized fork of the language identification tool langid.py☆60Updated last year
- English Resource Grammar☆24Updated 3 months ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆21Updated 4 years ago
- Trigram files for 500+ languages☆25Updated 10 months ago
- A transcription text editor with respeak module☆14Updated 2 weeks ago
- ☆86Updated 7 months ago
- A dataset of popular forenames and surnames by country☆55Updated 2 years ago
- A polite and user-friendly downloader for Common Crawl data☆67Updated 5 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆58Updated 4 years ago
- A collection of tools for archiving and analysing the internet.☆78Updated 3 years ago
- SCOWL (and friends).☆462Updated last week
- Gramadán: a computational grammar of Irish☆17Updated 3 years ago
- Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database☆26Updated last year
- an experimental implementation of Burrow's delta in Python 3☆21Updated 4 years ago
- Korpuslinguistik war noch nie so einfach...☆24Updated 7 months ago
- Tools to construct and process Common Crawl webgraphs☆105Updated last week
- Generate information about text including syllable counts and Flesch-Kincaid, Gunning-Fog, Coleman-Liau, SMOG and Automated Readability s…☆194Updated 9 years ago
- Code to compute the Dropbox API's "content_hash"☆73Updated 3 years ago