mattbierner / urban-dictionary-entry-collector
Script used to collect entry data from Urban Dictionary
☆33Updated 8 years ago
Alternatives and similar repositories for urban-dictionary-entry-collector:
Users that are interested in urban-dictionary-entry-collector are comparing it to the libraries listed below
- ☆97Updated 3 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆44Updated 7 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- WordNet in JSON format.☆90Updated 4 years ago
- Python library for reading and writing warc files☆239Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Serving content from a WARC☆61Updated 12 years ago
- A Utility Library for Wikipedia dumps☆33Updated 7 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆253Updated 4 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- Software and resources for natural language processing.☆131Updated 8 years ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- AMALGrAM, an English supersense tagger written in Python☆33Updated 7 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 3 months ago
- This repository contains tool and collections dataset for detecting off-topic pages from Web archived collections.☆18Updated 9 years ago
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- Common web archive utility code.☆53Updated 2 months ago
- A Deep NN used to generate stories which will tingle your butt.☆39Updated 3 years ago
- Excitement Open Platform for Recognizing Textual Entailments☆86Updated 7 years ago
- Socially-Equitable Language Identification☆78Updated last year
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆33Updated 3 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆125Updated 2 months ago
- A parser and autocorrection tool for wiktionary.☆39Updated 9 years ago
- Fast Word Clustering Software☆78Updated 2 weeks ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆28Updated 9 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- Bilingual sentence aligner (Gale & Church, 1993)☆14Updated 5 years ago
- Uses a distributed word representation to finds words along the hyperchord of two input words.☆102Updated 4 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago