vatsan / text_analytics_on_mpp
Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP platform (Greenplum/HAWQ).
☆17Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for text_analytics_on_mpp
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- ☆11Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- A book on the applications of topic models.☆14Updated 7 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- Deploy sentiment analysis using Flask☆17Updated 5 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆43Updated 8 years ago
- Machine Learning Open Source Software☆23Updated 6 years ago
- PDF and python files for creating time maps and downloading tweets☆58Updated 4 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 8 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 9 years ago
- ☆38Updated 8 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 7 years ago
- ☆41Updated 4 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- Data science repo to help others☆12Updated 8 years ago
- A tool that evolves small brains capable of scanning and classifying an image.☆13Updated 8 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 10 years ago
- A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum☆17Updated 10 years ago
- ☆14Updated 9 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Scikit-learn quickstart tutorial for Webstep☆18Updated 7 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 7 years ago