xsankar / fdps-v3
Code & Data for V3 of the Fast data Processing with Spark 2 book
☆15Updated 8 years ago
Alternatives and similar repositories for fdps-v3:
Users that are interested in fdps-v3 are comparing it to the libraries listed below
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Code for Packt Publishing's Spark for Data Science Cookbook.☆22Updated 7 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆29Updated 2 years ago
- Predictive analatics using deepLearning4j and Spark☆26Updated 8 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- Mastering Machine Learning with Spark 2.x, published by Packt☆43Updated 2 years ago
- Create scalable machine learning applications to power a modern data-driven business using Spark☆60Updated 2 years ago
- ☆26Updated last year
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 9 years ago
- Fast-Data-Processing-with-Spark-2☆22Updated 2 years ago
- ☆41Updated 8 years ago
- Code repository for Large Scale Machine Learning with Python, published by Packt☆90Updated 2 years ago
- Source code for 'Practical Hive' by Scott Shaw, Andreas François Vermeulen, Ankur Gupta, and David Kjerrumgaard☆34Updated 7 years ago
- ☆24Updated 8 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Companion code for my video course on Practical Python Data Science Techniques, published by Packt Publishing☆33Updated 7 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 2 years ago
- ☆28Updated 6 years ago
- Code files uploaded by Packt publishing☆33Updated 4 years ago
- Kaggle's click through rate prediction with Spark Pipeline API☆23Updated 9 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- Some deep resources from apache spark, cloudera, my practice and so on. Most important is what i think.☆13Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Zeppelin notebook examples☆26Updated 9 years ago
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Updated 8 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Updated 10 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago