emory-courses / data-scienceLinks
Practical Approaches to Data Science with Text
☆25Updated 6 years ago
Alternatives and similar repositories for data-science
Users that are interested in data-science are comparing it to the libraries listed below
Sorting:
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆316Updated 3 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- CrisisLex: Your data and lexical resource in crises☆51Updated last year
- Natural Language Processing.☆59Updated 4 years ago
- Practical Approaches to Data Science with Text☆39Updated 6 years ago
- Tutorial on computational models of language change☆116Updated 6 years ago
- Training Temporal Word Embeddings with a Compass☆65Updated 4 months ago
- This is an implementation of Hearst patterns, for finding hyponyms, written in Python.☆87Updated 3 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆37Updated 6 years ago
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆16Updated 5 years ago
- A set of media framing annotations, along with scripts for obtaining the corresponding news articles☆54Updated 6 years ago
- ☆54Updated 4 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 7 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29Updated 5 years ago
- See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse☆150Updated 5 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆62Updated last year
- a python package for cleaning Gutenberg books and dataset☆34Updated 7 months ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 3 years ago
- ☆11Updated 5 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- A temporal ordering system for events and time expressions in written text.☆42Updated 3 years ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago
- Collection of tools for building diachronic/historical word vectors☆443Updated 2 years ago
- Sentiment Lexicon Generation Suite☆15Updated 8 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 6 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 5 years ago
- Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)☆67Updated 8 years ago
- annotated hateful speech☆24Updated 6 years ago
- MiTextExplorer - interactive browser of text and document covariates.☆24Updated 10 years ago