gautamsm / data-science-on-mppLinks
A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases
☆20Updated 9 years ago
Alternatives and similar repositories for data-science-on-mpp
Users that are interested in data-science-on-mpp are comparing it to the libraries listed below
Sorting:
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆70Updated 5 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- An example to illustrate using Luigi to manage a data science workflow in Greenplum Database☆12Updated 6 years ago
- A place for all things Pivotal & R☆25Updated 3 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Updated 7 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 6 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Dask tutorial for PyData DC 2016☆11Updated 8 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Simplified tree-based classifier and regressor for interpretable machine learning (scikit-learn compatible)☆47Updated 4 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Docker container with a PyData stack and JupyterHub server☆37Updated 9 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Using Pandas easily with Cassandra☆23Updated 7 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 8 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- REST web service for scoring PMML models☆50Updated 11 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Machine learning evaluation database☆24Updated 7 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 11 years ago