aakashkag / People-Name-List
☆51Updated 10 months ago
Alternatives and similar repositories for People-Name-List:
Users that are interested in People-Name-List are comparing it to the libraries listed below
- Text databases of last names from various countries☆280Updated 2 years ago
- A comprehensive database of name variants☆46Updated 2 years ago
- Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefi…☆36Updated 2 weeks ago
- Lightning Fast Language Prediction 🚀☆166Updated 6 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 5 months ago
- Download all plaintext scripts from imsdb.com☆34Updated last year
- Lexical database of any language☆179Updated 2 years ago
- Simple RESTful API server running your own machine translation model. Docker image modified from mbartoli/easy-smt☆11Updated 6 years ago
- Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations☆15Updated 8 years ago
- roll a wikipedia dump into mongo☆243Updated 9 months ago
- Transliteration data and models☆56Updated 8 years ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- WordNet in JSON format.☆92Updated 4 years ago
- Parse a text corpus and generate sentences in the same style using context-free grammar combined with a Markov chain.☆34Updated 6 years ago
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 7 years ago
- Python 3 library for reading and writing warc files☆20Updated 7 years ago
- Reference datasets on historic and current names in the US☆46Updated 10 years ago
- ☆79Updated last year
- A Deep NN used to generate stories which will tingle your butt.☆39Updated 4 years ago
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆64Updated last year
- Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.☆133Updated 6 years ago
- a parser for Forsyth-Edwards Notation (for encoding chess positions)☆16Updated 9 years ago
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- Storage and retrieval of Word Embeddings in various databases☆51Updated 6 years ago
- Package for performing Reddit-based text analysis☆21Updated 6 years ago
- Finds the Jaro Winkler Distance indicating a distance or similarity score between two strings.☆26Updated 2 months ago
- SCOWL (and friends).☆419Updated 2 weeks ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆253Updated last year
- Sentiment Analysis applied to different datasets such as IMDB☆19Updated 9 years ago