mattbierner / urban-dictionary-entry-collector
Script used to collect entry data from Urban Dictionary
☆33Updated 9 years ago
Alternatives and similar repositories for urban-dictionary-entry-collector:
Users that are interested in urban-dictionary-entry-collector are comparing it to the libraries listed below
- WordNet in JSON format.☆92Updated 4 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- ☆97Updated 3 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 2 years ago
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- Java Wiktionary Library☆57Updated 2 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.☆133Updated 6 years ago
- An unsupervised compound splitter☆41Updated 5 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 8 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 10 months ago
- Advanced desktop search/corpus exploration prototype☆21Updated 3 years ago
- Labeled examples from wiki dumps in Python☆67Updated 8 years ago
- Fast Word Clustering Software☆78Updated 2 months ago
- German Morphological Analyzer☆47Updated 3 years ago
- AMALGrAM, an English supersense tagger written in Python☆33Updated 7 years ago
- My implementation of Explicit Semantic Analysis (ESA) library that we used at KMi, Open University to produce our submission at the NTCIR…☆36Updated 9 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆95Updated 3 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 6 years ago
- Ukb: graph-based WSD and similarity☆106Updated 11 months ago
- 2016 Presidential Campaign Speeches☆15Updated 8 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆150Updated 2 weeks ago
- Shell scripts to assist downloading & processing the Google n-grams corpora☆14Updated 8 years ago