rspeer / text-as-dataLinks
A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.
☆51Updated 11 years ago
Alternatives and similar repositories for text-as-data
Users that are interested in text-as-data are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- Public Machine Learning and Data Competition Repo☆54Updated 10 years ago
- Topic modeling web application☆40Updated 10 years ago
- rapid nlp prototyping☆71Updated 3 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 9 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Exploring Text, Graphically☆12Updated 10 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 10 years ago
- Turbo topics find significant multiword phrases in topics.☆46Updated 10 years ago
- Preprocess text for NLP (tokenizing, lowercasing, stemming, sentence splitting, etc.)☆29Updated 14 years ago
- Visualization of text sentiment using deep learning☆43Updated 9 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Demo code for learning_text_transformer☆25Updated 10 years ago
- ☆81Updated 9 years ago
- Stability analysis for topic models☆51Updated 9 years ago
- Flask app to run a bandit algorithm testing different beer recommenders☆25Updated 11 years ago
- Standalone Semanticizer☆32Updated 10 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 9 years ago
- Scripts and modules used for creating document clusters from word2vec☆40Updated 8 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 10 years ago
- This repo contain the exercies of the Next.ML 2015 presentation☆24Updated 10 years ago
- Latent Dirichlet Allocation with Gibbs sampling☆16Updated 12 years ago
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 10 years ago
- Source code for the tutorial series at http://www.thoughtly.co/blog/prototype☆32Updated 10 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Updated 3 years ago
- A hack to replace Pride & Prejudice text with closest word2vec model word, and visualize results.☆61Updated 11 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆58Updated 7 years ago