zfz / twitter_corpus
The twitter sentiment corpus created by Sanders Analytics, it consists of 5513 hand-classified tweets(however, 400 tweets missing due to the scripts created by the company). Each tweet was classified with respect to one of four different topics. And a twitter account password hash file is included as well.
☆57Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for twitter_corpus
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated last year
- Key information extraction from text and graph visualization☆91Updated 4 years ago
- This is Yunshu's [Activision](https://www.activision.com/) internship project. We are interested in understanding user opinions about Act…☆55Updated 5 years ago
- A python program that implements Aspect Based Sentiment Analysis classification system for SemEval 2016 Dataset.☆64Updated 7 years ago
- This repo contains code to detect sarcasm from text in discussion forum using deep learning☆86Updated last year
- ☆37Updated 8 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆107Updated last year
- Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019☆29Updated 5 years ago
- ☆53Updated 2 years ago
- Harry Potter and the Allocation of Dirichlet☆123Updated 5 years ago
- Applying NLP transfer learning techniques to predict Tweet stance toward a topic☆107Updated 5 years ago
- Multi Text Classificaiton☆92Updated 5 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 4 years ago
- Datasets for fake news and misinformation detection☆63Updated last year
- 🔤 Calculate average word embeddings (word2vec) from documents for transfer learning☆54Updated 6 months ago
- Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.☆76Updated 6 years ago
- Hierarchical, multi-label topic modelling with LDA☆53Updated last year
- Twitter word embeddings generated using Word2Vec and FastText.☆49Updated 5 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆82Updated 4 months ago
- ☆226Updated 7 years ago
- Aspect Based Sentiment Analysis is a special type of sentiment analysis. In an explicit aspect, opinion is expressed on a target(opinion …☆74Updated 4 years ago
- Segmentation based event detection from Tweets. Published at NAACL SRW 2019☆60Updated 3 months ago
- Training Temporal Word Embeddings with a Compass☆64Updated last year
- Sentiment analysis with SentiWordNet 3.0☆44Updated 8 years ago
- ☆83Updated 3 years ago
- Cleans Reddit Text Data☆81Updated 4 years ago
- This repo contains code and dataset for the Opinosis Summarization Framework☆50Updated 5 years ago
- A contextual approach for detecting hate speech code words☆9Updated 4 years ago
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 7 years ago