mattbierner / urban-dictionary-entry-collectorLinks
Script used to collect entry data from Urban Dictionary
☆33Updated 9 years ago
Alternatives and similar repositories for urban-dictionary-entry-collector
Users that are interested in urban-dictionary-entry-collector are comparing it to the libraries listed below
Sorting:
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 4 years ago
- ☆97Updated 4 years ago
- WordNet in JSON format.☆91Updated 4 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆154Updated 9 months ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆152Updated 2 weeks ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆315Updated 3 years ago
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- An open source toolkit for mining Wikipedia☆130Updated 6 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 7 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 3 years ago
- ☆57Updated 10 years ago
- A simple configurable tool for manipulating dependency trees.☆14Updated 7 months ago
- AMALGrAM, an English supersense tagger written in Python☆33Updated 8 years ago
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆353Updated 2 years ago
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆195Updated 6 years ago
- a collection of functions that measure the readability of a given body of text☆195Updated 7 years ago
- Tools to work with the big reddit JSON data dump.☆255Updated last year
- Machine translation for the real world☆23Updated 5 years ago
- Various utilities for processing the data.☆212Updated this week
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated 4 months ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 10 years ago
- A multilingual parallel corpus created from translations of the Bible.☆183Updated 2 months ago