vatsan / text_analytics_on_mpp
Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP platform (Greenplum/HAWQ).
☆17Updated 8 years ago
Related projects: ⓘ
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 9 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆43Updated 8 years ago
- PDF and python files for creating time maps and downloading tweets☆58Updated 4 years ago
- Using Word2Vec on lists and sets☆34Updated 8 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- Some IPython notebooks I've created...☆29Updated 8 years ago
- ☆11Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Machine learning evaluation database☆24Updated 6 years ago
- Visualization of text sentiment using deep learning☆44Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- A place for all things Pivotal & R☆25Updated 2 years ago
- Healthcare Twitter Analysis☆26Updated 8 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- word2vec workshop - a conceptual introduction and practical application☆22Updated 8 years ago
- Scripts to Analyze Pronto's Data Release☆25Updated 8 years ago
- Predicting sales with Pandas☆15Updated 8 years ago
- ☆12Updated 5 years ago
- Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark☆14Updated 8 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆34Updated 5 years ago
- A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum☆17Updated 10 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 5 years ago
- ☆41Updated 3 years ago
- ☆38Updated 8 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago