vatsan / text_analytics_on_mppLinks
Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP platform (Greenplum/HAWQ).
☆17Updated 9 years ago
Alternatives and similar repositories for text_analytics_on_mpp
Users that are interested in text_analytics_on_mpp are comparing it to the libraries listed below
Sorting:
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Deploy sentiment analysis using Flask☆17Updated 5 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Using Word2Vec on lists and sets☆34Updated 3 weeks ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- ☆11Updated 8 years ago
- PDF and python files for creating time maps and downloading tweets☆60Updated 4 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 9 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum☆17Updated 10 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆37Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago
- PyData Madrid 2016 material for the talk: A Primer to recommendation Systems☆37Updated 9 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 5 years ago
- Topic and sentiment analysis of tweets (demo)☆11Updated 6 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 3 weeks ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- TF-IDF with Spark for the Kaggle popcorn competition☆10Updated 9 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- Predicting sales with Pandas☆15Updated 9 years ago