jreynolds999 / NLP-Reddit-Classification
Natural Language Processing on text data scraped from the web in order to predict the author's political affiliation through machine learning and pattern analysis. Please see ReadMe for more info!
☆12Updated last year
Related projects: ⓘ
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 2 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆52Updated 2 years ago
- Text analysis with networks.☆283Updated 4 months ago
- A deep learning system for demographic inference (gender, age, and individual/person) that was trained on massive Twitter dataset using p…☆145Updated last year
- Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allo…☆40Updated 5 years ago
- Datasets for fake news and misinformation detection☆63Updated last year
- The dataset used to evaluate JobBERT on the task of job title normalization.☆22Updated 2 years ago
- Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classifi…☆51Updated 3 years ago
- A Corpus of 475,000 Industrial Occupations☆63Updated 3 years ago
- Code for the paper "Characterizing and Detecting Hateful Users on Twitter"☆73Updated 3 years ago
- Social Media Mining Toolkit (SMMT) main repository☆130Updated last year
- A Named Entity Recognition system that extracts soft skills from text☆26Updated last month
- Code for the paper "Content Analysis of Textbooks via Natural Language Processing".☆55Updated last year
- Nesta's Skills Extractor Library☆118Updated last month
- https://duyet.github.io/related-skills-visualization/index.html☆11Updated 4 years ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆225Updated 2 years ago
- ☆22Updated 3 years ago
- multi-labeled dataset of resumes☆69Updated 3 years ago
- Small tutorial on how you can use BERT for Topic Modeling☆16Updated 3 years ago
- Comprehensive database of ratings for 11k news domains☆21Updated last year
- A Python wrapper around the topic modeling functions of MALLET.☆99Updated 2 years ago
- A transformer-based language model trained on politics-related Twitter data. This repo is the official resource of the paper "PoliBERTwee…☆10Updated last year
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- Generate network visualizations from Twitter data.☆19Updated last year
- ☆40Updated 6 months ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆65Updated 9 months ago
- The Open Jobs Observatory public mirror repo☆20Updated last year
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated last year
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆175Updated 7 months ago
- ✨ Awesome - A curated list of amazing Topic Models (implementations, libraries, and resources)☆87Updated 2 years ago