vmware-archive / gp-r
A place for all things Pivotal & R
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for gp-r
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP …☆17Updated 8 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 6 years ago
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 9 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 9 years ago
- open source version of the Bonsai library☆26Updated 8 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 3 years ago
- An example project for doing grid search in MLlib☆13Updated 9 years ago
- from zero to storm cluster for realtime classification using sklearn☆12Updated 10 years ago
- Mirror of Apache Zeppelin (Incubating)☆45Updated 8 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 6 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.☆61Updated 5 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 9 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 5 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆43Updated 8 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 6 years ago
- ☆20Updated 7 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- R interface to Tensorflow via skflow☆18Updated 8 years ago
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 8 years ago
- unsupervised multi-model fraud-detection algorithm √☆11Updated 9 years ago
- A Topic Modeling toolbox☆93Updated 8 years ago
- Datasets and notebooks☆13Updated 8 years ago
- Deploy Dask on Marathon☆10Updated 7 years ago
- Sparklyr Extensions API☆31Updated 8 years ago
- An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlib.☆125Updated 2 years ago