vmware-archive / gp-r
A place for all things Pivotal & R
☆25Updated 2 years ago
Alternatives and similar repositories for gp-r:
Users that are interested in gp-r are comparing it to the libraries listed below
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 9 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 5 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- SmallK: very fast data clustering tools☆14Updated 5 years ago
- Deep neural networks on over 50 classification problems from the UC Irvine Machine Learning Repository☆25Updated 9 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- R interface to Tensorflow via skflow☆18Updated 8 years ago
- Apache Toree quickstart tutorial☆29Updated 8 years ago
- ☆20Updated 7 years ago
- Datasets and notebooks☆13Updated 8 years ago
- An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlib.☆126Updated 2 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- ☆28Updated 8 years ago
- Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP …☆17Updated 8 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- from zero to storm cluster for realtime classification using sklearn☆12Updated 10 years ago
- self organizing map and variations implemented in Spark☆9Updated 8 years ago
- training material☆47Updated 2 months ago
- Deploy Dask on Marathon☆10Updated 7 years ago
- Docker images for data science from Wise.io☆50Updated 8 years ago
- DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.☆61Updated 5 years ago
- R dplyr connector for ImpalaDB☆15Updated 7 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 5 years ago