mattbierner / urban-dictionary-entry-collector
Script used to collect entry data from Urban Dictionary
☆33Updated 9 years ago
Alternatives and similar repositories for urban-dictionary-entry-collector:
Users that are interested in urban-dictionary-entry-collector are comparing it to the libraries listed below
- ☆97Updated 3 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.☆65Updated 10 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- The Community-enRiched Open WordNet (CROWN)☆18Updated 9 years ago
- Fast Word Clustering Software☆78Updated last month
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 5 months ago
- Turbo topics find significant multiword phrases in topics.☆46Updated 9 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆126Updated 3 months ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- Software and resources for natural language processing.☆131Updated 8 years ago
- Tools to work with the big reddit JSON data dump.☆252Updated 8 months ago
- Tools to manipulate and extract data from wikipedia dumps☆46Updated 11 years ago
- ☆21Updated 6 years ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆27Updated 9 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- Machine translation for the real world☆23Updated 5 years ago
- Natural language generation language☆56Updated 5 years ago
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- Simple RESTful API server running your own machine translation model. Docker image modified from mbartoli/easy-smt☆11Updated 5 years ago
- topic model visualization☆52Updated 10 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆45Updated 7 years ago
- Extraction of the five journalistic W-questions (5W) from news articles☆19Updated 6 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆141Updated 6 years ago
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆148Updated 2 months ago
- A Java UIMA-based toolbox for multilingual and efficient terminology extraction an multilingual term alignment☆38Updated 7 years ago