Cleans Reddit Text Data
☆83Apr 14, 2020Updated 6 years ago
Alternatives and similar repositories for redditcleaner
Users that are interested in redditcleaner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pretrained models for the ranking task described in Cats and Captions vs. Creators and the Clock (WWW 2017)☆11Apr 28, 2019Updated 7 years ago
- Geolocation Inference for Reddit☆14Jun 17, 2024Updated 2 years ago
- An efficient implementation of a set which can be randomly sampled according to the weights of the elements.☆24May 13, 2024Updated 2 years ago
- ☆12Aug 2, 2024Updated last year
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Jun 9, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CSS workshop on word embeddings for the social sciences, 3/19/21☆12Mar 19, 2021Updated 5 years ago
- Learning to Distinguish Hypernyms and Co-Hyponyms☆18Nov 11, 2014Updated 11 years ago
- All purpose paper template with a base arxiv/revtex4 format and sample journal options☆23Apr 12, 2026Updated 2 months ago
- Read compressed NDJSON .zst files easily☆36May 13, 2026Updated last month
- Replication Materials for "Crowd-Sourced Text Analysis" APSR (2016) 110(2): 278-295.☆11Oct 28, 2017Updated 8 years ago
- A temporal ordering system for events and time expressions in written text.☆43Feb 26, 2022Updated 4 years ago
- Reproducible Retrieval of Pew Research Center Datasets in R☆10Apr 14, 2021Updated 5 years ago
- Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning☆13Mar 5, 2023Updated 3 years ago
- OSoMe Twitter tools. Including a package like tweepy but for the v2 Twitter api.☆31Jan 6, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The News Landscape Toolkit (NELA)☆16Oct 14, 2020Updated 5 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Oct 13, 2018Updated 7 years ago
- hacky way to bundle some edges in networkx and matplotlib☆20May 26, 2020Updated 6 years ago
- The COVID-19 Digital Observatory collects, aggregates, and distributes data from social media, search engine results, and Wikipedia to su…☆11Dec 17, 2020Updated 5 years ago
- Supplementary and replication materials for paper "Examining a Most Likely Case for Strong Campaign Effects: Hitler's Speeches and the Ri…☆15Jun 6, 2018Updated 8 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Nov 2, 2019Updated 6 years ago
- ☆12Jun 5, 2025Updated last year
- The list of Ukrainian words for sentiment analysis and NLP☆15Sep 5, 2021Updated 4 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Sep 3, 2024Updated last year
- Implements the model described in "Identification, Interpretability, and Bayesian Word Embeddings"☆19Jun 5, 2019Updated 7 years ago
- Code and Data for paper: Cross-Partisan Discussions on YouTube: Conservatives Talk to Liberals but Liberals Don't Talk to Conservatives (…☆13Jun 16, 2021Updated 5 years ago
- 🌬️urlExpander is a Python package for expanding shortened links (urls).☆76Oct 5, 2022Updated 3 years ago
- Simple Python wrapper for querying data with TikTok's research API☆13Dec 25, 2023Updated 2 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆288Mar 23, 2026Updated 2 months ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- Corpus-based Set Expansion with Lexical Features and Distributed Representations (SIGIR '19)☆13Jul 18, 2019Updated 6 years ago
- {rtweet} helpers for automating large or time-consuming downloads☆23Dec 11, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Count the number of matches for a regex string in a subreddit☆11May 29, 2020Updated 6 years ago
- ☆11Feb 8, 2022Updated 4 years ago
- Model for provider-neutral financial data, with implementation for IEX☆14Jul 30, 2019Updated 6 years ago
- The medical question entailment data introduced in the AMIA 2016 Paper (Recognizing Question Entailment for Medical Question Answering)☆14May 13, 2026Updated last month
- This repository is to download the SEMCATdataset 2018 for the publication "Senel L. K., Utlu I., Yucesoy V., Koc A., Cukur T., Semantic S…☆10Sep 18, 2020Updated 5 years ago
- [ICASSP'23] This repo contains code for the Demux & MEmo emotion recognition models (https://arxiv.org/abs/2210.15842), as well as code t…☆23Jan 18, 2024Updated 2 years ago
- Code of the paper Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role Labeling☆15Nov 15, 2020Updated 5 years ago