Cleans Reddit Text Data
☆83Apr 14, 2020Updated 6 years ago
Alternatives and similar repositories for redditcleaner
Users that are interested in redditcleaner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pretrained models for the ranking task described in Cats and Captions vs. Creators and the Clock (WWW 2017)☆11Apr 28, 2019Updated 7 years ago
- Geolocation Inference for Reddit☆14Jun 17, 2024Updated last year
- ☆12Aug 2, 2024Updated last year
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Jun 9, 2019Updated 6 years ago
- Learning to Distinguish Hypernyms and Co-Hyponyms☆18Nov 11, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CSS workshop on word embeddings for the social sciences, 3/19/21☆12Mar 19, 2021Updated 5 years ago
- Read compressed NDJSON .zst files easily☆36Jul 5, 2022Updated 3 years ago
- A temporal ordering system for events and time expressions in written text.☆42Feb 26, 2022Updated 4 years ago
- Reproducible Retrieval of Pew Research Center Datasets in R☆10Apr 14, 2021Updated 5 years ago
- Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning☆14Mar 5, 2023Updated 3 years ago
- A review of APIs.☆69Sep 17, 2024Updated last year
- The News Landscape Toolkit (NELA)☆16Oct 14, 2020Updated 5 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Oct 13, 2018Updated 7 years ago
- 🍉 Oh, my resume!☆12Apr 12, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Nov 2, 2019Updated 6 years ago
- ☆12Jun 5, 2025Updated 11 months ago
- Notebooks for JHU EN 601.320/420/620☆10May 1, 2019Updated 7 years ago
- The list of Ukrainian words for sentiment analysis and NLP☆15Sep 5, 2021Updated 4 years ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Sep 3, 2024Updated last year
- Code and Data for paper: Cross-Partisan Discussions on YouTube: Conservatives Talk to Liberals but Liberals Don't Talk to Conservatives (…☆13Jun 16, 2021Updated 4 years ago
- 🌬️urlExpander is a Python package for expanding shortened links (urls).☆76Oct 5, 2022Updated 3 years ago
- mapped with CPT and ICD codes☆16Dec 7, 2015Updated 10 years ago
- Simple Python wrapper for querying data with TikTok's research API☆13Dec 25, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Twitter stream and social network crawling tools☆17Nov 17, 2016Updated 9 years ago
- {rtweet} helpers for automating large or time-consuming downloads☆23Dec 11, 2019Updated 6 years ago
- ☆11Feb 8, 2022Updated 4 years ago
- This repository is to download the SEMCATdataset 2018 for the publication "Senel L. K., Utlu I., Yucesoy V., Koc A., Cukur T., Semantic S…☆10Sep 18, 2020Updated 5 years ago
- A high performance indexing and search system for managing big data☆18Mar 18, 2019Updated 7 years ago
- ☆15Apr 12, 2023Updated 3 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 7 years ago
- Dataset to train NLP model predicting YouTube title based on video content☆20May 10, 2025Updated 11 months ago
- Code of the paper Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role Labeling☆15Nov 15, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Sparse Additive Generative Model of Text☆90Sep 2, 2016Updated 9 years ago
- Dataset containing Semantic Relations and Metadata, for Training and Evaluating Distributional Semantic Models in English and Mandarin Ch…☆16Aug 7, 2017Updated 8 years ago
- Implements the Adaptive Fuzzy String Matching model from Kaufman & Klevs☆11Nov 28, 2022Updated 3 years ago
- KSP plugin showing a temperature gauge and which part is going to blow up first☆10Apr 30, 2022Updated 4 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- nytimes: Interacting with New York TImes APIs☆27Aug 4, 2018Updated 7 years ago
- Scrape comments, including their replies, from a YouTube video.☆39Jan 31, 2021Updated 5 years ago