Tokenization and pre-processing for Twitter data used to train classifiers.
☆72Sep 28, 2016Updated 9 years ago
Alternatives and similar repositories for tweetokenize
Users that are interested in tweetokenize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Brief Introduction to Text Analysis Using R☆15Oct 27, 2016Updated 9 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆38Mar 27, 2015Updated 11 years ago
- Topical search for Twitter. See twokenize.py, emoticons.py for tokenization.☆161Sep 15, 2021Updated 4 years ago
- deep inverse regression☆31Nov 3, 2015Updated 10 years ago
- Presentation for the NYU Data Lab December 2015☆14Dec 2, 2015Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Dataset and generative scripts for 3,200+ US Secretary of State visits (1905-present)☆20Dec 19, 2016Updated 9 years ago
- ☆13May 10, 2018Updated 8 years ago
- Statistics for each published edition of Data Is Plural.☆17Jun 1, 2021Updated 5 years ago
- Classify Twitter accounts as institutional or ordinary users.☆12Nov 16, 2018Updated 7 years ago
- Stability analysis for topic models☆52Oct 16, 2016Updated 9 years ago
- Code and data from our ACL 2014 paper "Humans Require Context to Infer Ironic Intent (so Computers Probably do, too)"☆16Jun 23, 2014Updated 12 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- Classifier for predicting user interests based on Twitter profile and using Python library scikit-learn.☆31Jun 7, 2013Updated 13 years ago
- ☆18Feb 6, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10May 11, 2017Updated 9 years ago
- Scale ideological slant of Tweets☆21Jul 19, 2019Updated 6 years ago
- Deduplicate and parse list of `dirty names'☆22Nov 4, 2020Updated 5 years ago
- R Package to stream and analyze tweets using a mongodb☆13Mar 1, 2016Updated 10 years ago
- Text as Data Material for WashU Course☆15Nov 7, 2017Updated 8 years ago
- Cross-domain word representation learning☆10May 23, 2015Updated 11 years ago
- List of New York Times wedding announcements used in an Upshot story on name-changing.☆10Mar 7, 2019Updated 7 years ago
- Tools for scraping of twitter data, conversion, text analysis and graph construction☆11Aug 1, 2016Updated 9 years ago
- Materials for the WWW 2015 tutorial on online experiments for computational social science☆67Jul 7, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Oct 13, 2018Updated 7 years ago
- ☆14Feb 12, 2016Updated 10 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Sep 30, 2015Updated 10 years ago
- MVC web framework in Python with Gevent, Jinja2, Werkzeug, SqlAlchemy, SASS☆45Apr 14, 2023Updated 3 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Sep 15, 2021Updated 4 years ago
- Lexical lemmatizer of italian text☆14Jun 12, 2017Updated 9 years ago
- smappdragon is a set of tools for working with twitter data.☆29Sep 1, 2018Updated 7 years ago
- Cyberbullying Detection System☆41Jun 17, 2015Updated 11 years ago
- Improve upon sentiment predictions for a Twitter dataset☆15Mar 7, 2016Updated 10 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Set up MIT's CLIFF geolocation service with Vagrant☆16May 5, 2015Updated 11 years ago
- Repository of detailed instructions for running online panel field experiments.☆36Aug 26, 2021Updated 4 years ago
- multilevel spatially-correlated variance components models☆18Jul 9, 2024Updated last year
- The Fallacy of Placing Confidence in Confidence Intervals☆38Oct 11, 2015Updated 10 years ago
- Code for DataViz course website☆19Feb 29, 2016Updated 10 years ago
- ☆10Mar 18, 2021Updated 5 years ago
- This is a completely open-source repo of interview questions and answers for people preparing for such interviews. This is maintained by …☆19May 18, 2026Updated last month