FinNLP / humannamesLinks
π¦ A list, huge one (~200K) of human male/female first/last names.
β54Updated last year
Alternatives and similar repositories for humannames
Users that are interested in humannames are comparing it to the libraries listed below
Sorting:
- Scraper for downloading the entire ebooks repository of project Gutenbergβ152Updated this week
- Machine-readable lists of lemma-token pairs in 23 languages.β345Updated 3 years ago
- Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and moreβ20Updated 6 years ago
- pure javascript lstm rnn implementation based on ocropusβ39Updated 10 years ago
- Trigram files for 500+ languagesβ24Updated 6 months ago
- β108Updated 2 weeks ago
- An advanced, extensible web front-end for the Manatee-open corpus search engineβ76Updated last month
- The Unicode Cookbook for Linguistsβ56Updated 4 years ago
- Authoring tool for interactive content.β22Updated 2 weeks ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.β14Updated 2 years ago
- An XML parser for lezerβ16Updated 9 months ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the datβ¦β159Updated 9 months ago
- The largest English-language thesaurusβ305Updated 3 weeks ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β50Updated last week
- Customizable machine translation in C++β53Updated last year
- Stylometric Data Mining Library with a focus on identifying Satoshi Nakamoto as a case study.β28Updated last year
- A learning JavaScript dictionary-based word prediction / autocomplete / suggestion library.β40Updated 2 years ago
- RosaeNLG is a Natural Language Generation library for node.js and browser rendering, based on the Pug template engine.β103Updated 9 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM formatβ71Updated 6 months ago
- Lists of most-frequently-used english words / nouns / verbs etc.β85Updated 5 years ago
- A collection of tools for archiving and analysing the internet.β78Updated 3 years ago
- Unsupervised text summarization using the lexrank algorithmβ15Updated 3 years ago
- Offline database of synonyms/thesaurusβ202Updated last year
- Gather modern English word frequencies from all enwiki articles.β226Updated last year
- A machine readable JSON QAnon dataset, archiving all QAnon drops for research onlyβ28Updated 5 months ago
- This project represents the 300-dimensional word vectors from word2vec as JSON.β128Updated 8 years ago
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.β24Updated 2 years ago
- Text to IPA converter in JavaScriptβ58Updated 3 years ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.β130Updated last month
- Search Apps for the Searchlabβ15Updated 3 years ago