vatsan / text_analytics_on_mppLinks
Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP platform (Greenplum/HAWQ).
☆17Updated 9 years ago
Alternatives and similar repositories for text_analytics_on_mpp
Users that are interested in text_analytics_on_mpp are comparing it to the libraries listed below
Sorting:
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 2 months ago
- Example scripts for various deep learning APIs.☆28Updated 10 years ago
- PDF and python files for creating time maps and downloading tweets☆59Updated 5 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 9 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 9 years ago
- Using Word2Vec on lists and sets☆34Updated 2 months ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Code for the DeepScript Submission to ICFHR2016 Competition on the Classification of Medieval Handwritings in Latin Script☆17Updated 8 years ago
- Healthcare Twitter Analysis☆26Updated 9 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- This repo contain the exercies of the Next.ML 2015 presentation☆24Updated 10 years ago
- Deep learning for hackers: a hands-on approach to machine learning and deep learning.☆67Updated 10 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆37Updated 10 years ago
- ☆28Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- Visualization of text sentiment using deep learning☆43Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- A system for connecting language to space and time.☆64Updated 4 years ago
- Quick & dirty repo for hosting the Notebook for t-SNE presentation at delivered at Python Quants and PyData London meetups☆9Updated 9 years ago
- Data science repo to help others☆12Updated 9 years ago
- ☆46Updated 3 months ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 9 years ago
- Generalized Language Modeling toolkit☆51Updated 3 years ago
- Install directions and example notebooks for Udacity's Deep Learning classes☆28Updated 9 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 11 years ago