gautamsm / data-science-on-mppLinks
A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases
☆20Updated 9 years ago
Alternatives and similar repositories for data-science-on-mpp
Users that are interested in data-science-on-mpp are comparing it to the libraries listed below
Sorting:
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Deploy sentiment analysis using Flask☆17Updated 5 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- A place for all things Pivotal & R☆25Updated 3 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆70Updated 5 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- Spark Parameter Optimization and Tuning☆31Updated 7 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 10 years ago
- Notes on Lambda Architecture☆12Updated 7 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Updated 11 years ago
- Python client for ScienceOps☆29Updated 5 years ago
- Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP …☆17Updated 9 years ago
- ☆41Updated 7 years ago
- Data science repo to help others☆12Updated 9 years ago
- Machine Learning Versioning made Simple☆38Updated 2 years ago
- Simplified tree-based classifier and regressor for interpretable machine learning (scikit-learn compatible)☆47Updated 4 years ago
- Exploration Library in Java☆12Updated last year
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 10 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆29Updated 7 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆65Updated 10 years ago
- ☆36Updated 9 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 10 years ago