Tokenization and pre-processing for Twitter data used to train classifiers.
☆72Sep 28, 2016Updated 9 years ago
Alternatives and similar repositories for tweetokenize
Users that are interested in tweetokenize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Mar 27, 2015Updated 11 years ago
- Topical search for Twitter. See twokenize.py, emoticons.py for tokenization.☆161Sep 15, 2021Updated 4 years ago
- deep inverse regression☆31Nov 3, 2015Updated 10 years ago
- Presentation for the NYU Data Lab December 2015☆14Dec 2, 2015Updated 10 years ago
- Dataset and generative scripts for 3,200+ US Secretary of State visits (1905-present)☆20Dec 19, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PSCI 8357: Statistics for Political Research II☆11Apr 21, 2016Updated 10 years ago
- Python library for interacting with smapp collections☆19May 30, 2016Updated 10 years ago
- ☆13May 10, 2018Updated 8 years ago
- Statistics for each published edition of Data Is Plural.☆17Jun 1, 2021Updated 5 years ago
- Code of NAACL paper "Unsupervised Multi-Domain Adaptation with Feature Embeddings"☆33May 7, 2015Updated 11 years ago
- Stability analysis for topic models☆52Oct 16, 2016Updated 9 years ago
- Code and data from our ACL 2014 paper "Humans Require Context to Infer Ironic Intent (so Computers Probably do, too)"☆16Jun 23, 2014Updated 11 years ago
- Slides and homework for model based inference☆13Sep 26, 2017Updated 8 years ago
- ☆18Feb 6, 2016Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An application of stacked denoising autoencoders to multi-modal (images and audio) abstract feature discovery☆12Oct 23, 2013Updated 12 years ago
- Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)☆21May 19, 2021Updated 5 years ago
- Scale ideological slant of Tweets☆21Jul 19, 2019Updated 6 years ago
- Production and Consumption of APSR, BJPS, Perspectives, PS, and World Politics Articles☆14Jun 12, 2023Updated 3 years ago
- Deduplicate and parse list of `dirty names'☆22Nov 4, 2020Updated 5 years ago
- R Package to stream and analyze tweets using a mongodb☆13Mar 1, 2016Updated 10 years ago
- Information about and materials for graduate course "Logic of Quantitative Research in Political Science" at the University of Copenhagen…☆17Feb 14, 2017Updated 9 years ago
- Cross-domain word representation learning☆10May 23, 2015Updated 11 years ago
- WordNet to neo4j 2.2☆12Nov 6, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- My MSc project☆14Jun 5, 2011Updated 15 years ago
- A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.☆51Oct 23, 2014Updated 11 years ago
- This project contains the necessary files to reproduce the paper: "Explaining Character-Aware Neural Networks for Word-Level Prediction: …☆12Nov 15, 2018Updated 7 years ago
- Tools for scraping of twitter data, conversion, text analysis and graph construction☆11Aug 1, 2016Updated 9 years ago
- Materials for the WWW 2015 tutorial on online experiments for computational social science☆67Jul 7, 2015Updated 10 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Oct 13, 2018Updated 7 years ago
- ☆14Feb 12, 2016Updated 10 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Sep 30, 2015Updated 10 years ago
- MVC web framework in Python with Gevent, Jinja2, Werkzeug, SqlAlchemy, SASS☆45Apr 14, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Sep 15, 2021Updated 4 years ago
- This tool scrapes status.pr every hour and keeps tracks of changing metrics in order to help visualize and measure progress.☆14Dec 8, 2022Updated 3 years ago
- Lexical lemmatizer of italian text☆14Jun 12, 2017Updated 9 years ago
- Cyberbullying Detection System☆41Jun 17, 2015Updated 10 years ago
- My ongoing portfolio showcasing Data Science and Programming Projects☆11Aug 9, 2022Updated 3 years ago
- Improve upon sentiment predictions for a Twitter dataset☆15Mar 7, 2016Updated 10 years ago
- R Package for Automated Speech Recognition☆10Aug 10, 2015Updated 10 years ago