maxent-ai / DatasetsLinks
datasets with text data for use in NLP, Text analysis, information extraction, ML research.
☆16Updated 6 years ago
Alternatives and similar repositories for Datasets
Users that are interested in Datasets are comparing it to the libraries listed below
Sorting:
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Updated 6 years ago
- Political Discourse Analysis Using Pre-Trained Word Vectors.☆23Updated 2 years ago
- This is Yunshu's [Activision](https://www.activision.com/) internship project. We are interested in understanding user opinions about Act…☆57Updated 6 years ago
- The objective of this project is to scrape a corpus of news articles from a set of web pages, pre-process the corpus, and then to apply u…☆49Updated 8 years ago
- ☆43Updated 10 years ago
- Detecting Sarcasm on Twitter using both traditonal machine learning and deep learning techniques.☆100Updated 7 years ago
- Build a deep learning model for predicting the named entities from text.☆55Updated 7 years ago
- [AAAI SAP 2020] Modeling Personality with Attentive Networks and Contextual Embeddings☆60Updated 3 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆49Updated last year
- Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.☆46Updated 5 years ago
- ☆32Updated 7 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- ☆16Updated 2 years ago
- Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team☆27Updated 4 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- ☆40Updated 10 years ago
- Transfer Learning for NLP Tasks☆55Updated 7 years ago
- Social Media Mining Toolkit (SMMT) main repository☆137Updated 3 years ago
- 🏖TagEditor - Annotation tool for spaCy☆193Updated 3 years ago
- Dataset and scripts for scraping the news articles from popular sources along with the summary of the article.☆49Updated 6 years ago
- Twitter Trends is a web-based application that automatically detects and analyzes emerging topics in real time through hashtags and user …☆110Updated 8 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 7 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 3 years ago
- materials for the study on mental health subreddits. If you use this code in your work, please cite George Gkotsis, Anika Oellrich, Tim …☆22Updated last year
- BERT semantic search engine for searching literature research papers for coronavirus covid-19 in google colab☆31Updated 5 years ago
- Using NLP to cluster reddit user comments by topics☆14Updated 8 years ago
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 8 years ago
- A traits predictor using Python☆15Updated 7 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Final Group ProjeDetecting Human Emotions Using Natural Language Processing UCB Data Analytics☆21Updated last year