pushshift / Reddit-Bot-Detector
Script to extract highly probable bots for further analysis
☆12Updated 7 years ago
Alternatives and similar repositories for Reddit-Bot-Detector:
Users that are interested in Reddit-Bot-Detector are comparing it to the libraries listed below
- Predict age and gender from a first name☆60Updated 6 years ago
- DreamBank Visualized - An interactive visualization of over 26,000 dream transcriptions☆15Updated 6 years ago
- Read compressed NDJSON .zst files easily☆32Updated 2 years ago
- Turning news into events since 2014.☆50Updated 7 years ago
- Topic modelling with SpaCy, Gensim and Textacy☆19Updated 6 years ago
- extract relationships from standardized terms from corpus of interest with deep learning☆20Updated 5 years ago
- A high performance indexing and search system for managing big data☆17Updated 5 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- This is the text partitioner project for Python.☆21Updated 6 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 3 months ago
- Vector Space Model Framework developed for InPhO☆36Updated 5 years ago
- Twitter user classification tutorial at PyCon France 2016☆21Updated last year
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Stylometric framework in Python☆13Updated 9 years ago
- The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private…☆23Updated 3 years ago
- ☆31Updated 9 years ago
- The documentation and scripts for the Local News Dataset☆23Updated 2 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- Classifying the content of domains☆56Updated last year
- Spell correct entire sentences using nltk freqdist and symspell☆19Updated 7 years ago
- [development moved to termite-data-server]☆61Updated 10 years ago
- Extract all the fields from the NY Times Corpus to a csv☆26Updated 2 years ago
- smappdragon is a set of tools for working with twitter data.☆29Updated 6 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆52Updated 3 years ago
- OSoMe API mashups☆11Updated 5 years ago
- topic model visualization☆52Updated 9 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated last year
- Topic Model or LDA in Cython☆21Updated 13 years ago