FinNLP / humannames
📦 A list, huge one (~200K) of human male/female first/last names.
☆41Updated last year
Related projects ⓘ
Alternatives and complementary repositories for humannames
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆94Updated this week
- Machine-readable lists of lemma-token pairs in 23 languages.☆333Updated 2 years ago
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)☆83Updated 2 years ago
- Lists of most-frequently-used english words / nouns / verbs etc.☆48Updated 4 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆29Updated 2 months ago
- Probably the most advanced command-line english dictionary ever.☆38Updated 4 years ago
- International Phonetic Alphabet RESTful API for words☆43Updated 5 years ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆124Updated 8 months ago
- Customizable machine translation in C++☆43Updated 7 months ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆51Updated 8 years ago
- Generates free or fixed verse poetry from any text corpus using Ngram natural language generator (markov chains) + pos tagging + rhyme id…☆28Updated 10 years ago
- Offline database of synonyms/thesaurus☆189Updated 9 months ago
- All languages stopwords collection☆423Updated 10 months ago
- varied english texts for modern NLP testing☆73Updated 2 years ago
- Gutenberg cache and query library☆36Updated 3 months ago
- A simple bot framework for commenting in subreddits.☆13Updated 7 years ago
- Browser version of Hyphe (WIP)☆29Updated last month
- List of all English Words☆23Updated last year
- Tag news stories based on models trained on the NYT corpus.☆40Updated last year
- generate rules from lists of words☆16Updated 3 years ago
- Data for the International Phonetic Alphabet (IPA)☆26Updated last year
- Scraper for downloading the entire ebooks repository of project Gutenberg☆135Updated 3 weeks ago
- A scrapy spider to extract post, thread, and user information from a vBulletin forum to a MongoDB database.☆31Updated 8 years ago
- Scraping comments from Youtube.☆24Updated 2 years ago
- Wombat.js client-side rewriting library☆84Updated last week
- The Data Format for Digital Linguistics (DaFoDiL)☆22Updated last year
- Interactive visualization of Wiktionary words and etymologies.☆90Updated this week
- 🎀 JavaScript API for spaCy with Python REST API☆193Updated last year
- WordNet in JSON format.☆91Updated 4 years ago
- English Part-of-speech (POS) tagger☆65Updated last year