Cleans Reddit Text Data
☆83Apr 14, 2020Updated 5 years ago
Alternatives and similar repositories for redditcleaner
Users that are interested in redditcleaner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient implementation of a set which can be randomly sampled according to the weights of the elements.☆24May 13, 2024Updated last year
- ☆12Aug 2, 2024Updated last year
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Jun 9, 2019Updated 6 years ago
- Learning to Distinguish Hypernyms and Co-Hyponyms☆18Nov 11, 2014Updated 11 years ago
- All purpose paper template with a base arxiv/revtex4 format and sample journal options☆23Dec 19, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Read compressed NDJSON .zst files easily☆35Jul 5, 2022Updated 3 years ago
- A temporal ordering system for events and time expressions in written text.☆42Feb 26, 2022Updated 4 years ago
- An engine for fast time series data aggregation☆13Jan 8, 2026Updated 2 months ago
- Reproducible Retrieval of Pew Research Center Datasets in R☆10Apr 14, 2021Updated 4 years ago
- Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning☆14Mar 5, 2023Updated 3 years ago
- A review of APIs.☆69Sep 17, 2024Updated last year
- Tracking significant changes to the Twitter API or platform as a whole☆20May 16, 2022Updated 3 years ago
- The News Landscape Toolkit (NELA)☆16Oct 14, 2020Updated 5 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Oct 13, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- hacky way to bundle some edges in networkx and matplotlib☆20May 26, 2020Updated 5 years ago
- 🍉 Oh, my resume!☆12Apr 12, 2023Updated 2 years ago
- Supplementary and replication materials for paper "Examining a Most Likely Case for Strong Campaign Effects: Hitler's Speeches and the Ri…☆14Jun 6, 2018Updated 7 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Nov 2, 2019Updated 6 years ago
- Notebooks for JHU EN 601.320/420/620☆10May 1, 2019Updated 6 years ago
- This repository contains data of TruthSocial posts related to the 2024 U.S. Elections☆12Nov 1, 2024Updated last year
- quanteda textmodel extensions for classifying documents☆21Oct 17, 2023Updated 2 years ago
- Experiments for recognising textual entailment☆14Oct 12, 2012Updated 13 years ago
- An API for retrieving locally-relevant structured data about US elections☆30Nov 10, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The list of Ukrainian words for sentiment analysis and NLP☆15Sep 5, 2021Updated 4 years ago
- A distributed test automation framework with a centralized management web UI☆20Oct 3, 2023Updated 2 years ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Sep 3, 2024Updated last year
- Implements the model described in "Identification, Interpretability, and Bayesian Word Embeddings"☆19Jun 5, 2019Updated 6 years ago
- Code and Data for paper: Cross-Partisan Discussions on YouTube: Conservatives Talk to Liberals but Liberals Don't Talk to Conservatives (…☆13Jun 16, 2021Updated 4 years ago
- 🌬️urlExpander is a Python package for expanding shortened links (urls).☆76Oct 5, 2022Updated 3 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆287Mar 17, 2026Updated last week
- Twitter stream and social network crawling tools☆17Nov 17, 2016Updated 9 years ago
- Corpus-based Set Expansion with Lexical Features and Distributed Representations (SIGIR '19)☆13Jul 18, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Using Python to extract the financial data from XBRL instance documents.☆11May 2, 2021Updated 4 years ago
- Paper and related materials for Rodriguez & Spirling (JOP, 2022) word embeddings overview and assessment☆49Feb 14, 2022Updated 4 years ago
- The medical question entailment data introduced in the AMIA 2016 Paper (Recognizing Question Entailment for Medical Question Answering)☆14Jan 27, 2023Updated 3 years ago
- A high performance indexing and search system for managing big data☆18Mar 18, 2019Updated 7 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 7 years ago
- [ICASSP'23] This repo contains code for the Demux & MEmo emotion recognition models (https://arxiv.org/abs/2210.15842), as well as code t…☆23Jan 18, 2024Updated 2 years ago
- Dataset to train NLP model predicting YouTube title based on video content☆19May 10, 2025Updated 10 months ago