vmware-archive / gp-r
A place for all things Pivotal & R
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for gp-r
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 9 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 5 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- open source version of the Bonsai library☆26Updated 8 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP …☆17Updated 8 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- Mirror of Apache Spark (With R Frontend on Spark Streaming)☆11Updated 9 years ago
- Deep neural networks on over 50 classification problems from the UC Irvine Machine Learning Repository☆25Updated 9 years ago
- Deep learning certificate part 1☆10Updated 2 years ago
- Data science repo to help others☆12Updated 8 years ago
- Apache Toree quickstart tutorial☆29Updated 8 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 9 years ago
- unsupervised multi-model fraud-detection algorithm √☆11Updated 9 years ago
- An R package to streamline the training, fine-tuning and predicting processes for deep learning based on 'darch' and 'deepnet'.☆45Updated 9 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆43Updated 8 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 6 years ago
- from zero to storm cluster for realtime classification using sklearn☆12Updated 10 years ago
- Distributed Matrix Library☆70Updated 7 years ago
- Scalable R for Machine Learning☆42Updated 6 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 6 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 9 years ago