vatsan / text_analytics_on_mpp
Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP platform (Greenplum/HAWQ).
☆17Updated 9 years ago
Alternatives and similar repositories for text_analytics_on_mpp:
Users that are interested in text_analytics_on_mpp are comparing it to the libraries listed below
- Deploy sentiment analysis using Flask☆17Updated 5 years ago
- A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum☆17Updated 10 years ago
- PDF and python files for creating time maps and downloading tweets☆59Updated 4 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Topic and sentiment analysis of tweets (demo)☆11Updated 6 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Updated 9 years ago
- Pattern-of-Behavior Search Tool☆11Updated 2 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 8 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- A book on the applications of topic models.☆14Updated 7 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Python client for ScienceOps☆29Updated 5 years ago
- ☆20Updated 8 years ago
- Generalized Language Modeling toolkit☆51Updated 2 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- Some IPython notebooks I've created...☆29Updated 9 years ago
- ☆41Updated 4 years ago
- Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).☆16Updated 10 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- Healthcare Twitter Analysis☆26Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 5 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- open source version of the Bonsai library☆26Updated 9 years ago
- Distributed Streaming Quantiles (for PySpark)☆38Updated 11 years ago