gautamsm / data-science-on-mpp
A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases
☆20Updated 8 years ago
Alternatives and similar repositories for data-science-on-mpp:
Users that are interested in data-science-on-mpp are comparing it to the libraries listed below
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 10 years ago
- Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark☆14Updated 8 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆70Updated 5 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Data science repo to help others☆12Updated 9 years ago
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Updated 7 years ago
- Using Pandas easily with Cassandra☆23Updated 7 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- HopsYARN Tensorflow Framework.☆31Updated 5 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- The slides, code examples and resources for the PyCon 2015 Ireland talk on building data pipelines☆13Updated 9 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Updated 11 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- REST web service for scoring PMML models☆50Updated 11 years ago
- Seldon UCL Project☆17Updated 8 years ago
- Experimental parallel data analysis toolkit.☆121Updated 3 years ago
- Spark Parameter Optimization and Tuning☆31Updated 6 years ago
- Example Tensorflow Processor using Java API for Apache NiFi 1.2 - 1.9.1+☆39Updated 5 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- ☆26Updated last year
- Some wrappers around python modules for simplifying the data exploration process.☆13Updated 3 months ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Apache Sqoop Cookbook☆36Updated 11 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆64Updated 10 years ago