Tokenization and pre-processing for Twitter data used to train classifiers.
☆72Sep 28, 2016Updated 9 years ago
Alternatives and similar repositories for tweetokenize
Users that are interested in tweetokenize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Mar 27, 2015Updated 11 years ago
- Topical search for Twitter. See twokenize.py, emoticons.py for tokenization.☆161Sep 15, 2021Updated 4 years ago
- deep inverse regression☆31Nov 3, 2015Updated 10 years ago
- Presentation for the NYU Data Lab December 2015☆14Dec 2, 2015Updated 10 years ago
- Dataset and generative scripts for 3,200+ US Secretary of State visits (1905-present)☆20Dec 19, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PSCI 8357: Statistics for Political Research II☆11Apr 21, 2016Updated 9 years ago
- Python library for interacting with smapp collections☆19May 30, 2016Updated 9 years ago
- ☆13May 10, 2018Updated 7 years ago
- Computational Text Analysis Workshop Materials☆36May 6, 2016Updated 9 years ago
- Stability analysis for topic models☆52Oct 16, 2016Updated 9 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- Slides and homework for model based inference☆13Sep 26, 2017Updated 8 years ago
- Classifier for predicting user interests based on Twitter profile and using Python library scikit-learn.☆31Jun 7, 2013Updated 12 years ago
- An application of stacked denoising autoencoders to multi-modal (images and audio) abstract feature discovery☆12Oct 23, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)☆21May 19, 2021Updated 4 years ago
- Automated svn2git mirror of include-what-you-use: link goes to upstream☆13May 27, 2015Updated 10 years ago
- Python module and command line script client for http://urbandictionary.com☆31Oct 20, 2019Updated 6 years ago
- Scale ideological slant of Tweets☆21Jul 19, 2019Updated 6 years ago
- Production and Consumption of APSR, BJPS, Perspectives, PS, and World Politics Articles☆14Jun 12, 2023Updated 2 years ago
- ☆17Apr 24, 2016Updated 9 years ago
- R Package to stream and analyze tweets using a mongodb☆13Mar 1, 2016Updated 10 years ago
- Information about and materials for graduate course "Logic of Quantitative Research in Political Science" at the University of Copenhagen…☆17Feb 14, 2017Updated 9 years ago
- Text as Data Material for WashU Course☆15Nov 7, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Cross-domain word representation learning☆10May 23, 2015Updated 10 years ago
- Python natural language processing work☆29Sep 14, 2009Updated 16 years ago
- Benchmarking different LSTM libraries☆25Mar 22, 2016Updated 10 years ago
- List of New York Times wedding announcements used in an Upshot story on name-changing.☆10Mar 7, 2019Updated 7 years ago
- My MSc project☆14Jun 5, 2011Updated 14 years ago
- A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.☆51Oct 23, 2014Updated 11 years ago
- This project contains the necessary files to reproduce the paper: "Explaining Character-Aware Neural Networks for Word-Level Prediction: …☆12Nov 15, 2018Updated 7 years ago
- Tools for scraping of twitter data, conversion, text analysis and graph construction☆11Aug 1, 2016Updated 9 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Oct 13, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Feb 12, 2016Updated 10 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Sep 30, 2015Updated 10 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Sep 15, 2021Updated 4 years ago
- smappdragon is a set of tools for working with twitter data.☆29Sep 1, 2018Updated 7 years ago
- Classification of incivility in Reddit posts☆18Nov 19, 2020Updated 5 years ago
- Improve upon sentiment predictions for a Twitter dataset☆15Mar 7, 2016Updated 10 years ago
- My ongoing portfolio showcasing Data Science and Programming Projects☆11Aug 9, 2022Updated 3 years ago