xsankar / fdps-v3
Code & Data for V3 of the Fast data Processing with Spark 2 book
☆15Updated 8 years ago
Alternatives and similar repositories for fdps-v3:
Users that are interested in fdps-v3 are comparing it to the libraries listed below
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆29Updated 2 years ago
- Fast-Data-Processing-with-Spark-2☆22Updated 2 years ago
- Create scalable machine learning applications to power a modern data-driven business using Spark☆60Updated 2 years ago
- Code files uploaded by Packt publishing☆33Updated 4 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- Updated repository☆157Updated 3 years ago
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Updated 8 years ago
- Spark 2.0 Python Machine Learning examples☆97Updated 5 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆83Updated 5 years ago
- Python library for converting Apache Spark ML pipelines to PMML☆95Updated last year
- PySpark Machine Learning Examples☆44Updated 6 years ago
- ☆44Updated 7 years ago
- Code for Packt Publishing's Spark for Data Science Cookbook.☆22Updated 7 years ago
- Spark SQL UDF examples☆56Updated 7 years ago
- Source code for 'Practical Hive' by Scott Shaw, Andreas François Vermeulen, Ankur Gupta, and David Kjerrumgaard☆34Updated 7 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- Machine Learning with Spark - Second Edition, by Packt☆115Updated 4 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- Source code for 'Pro Spark Streaming' by Zubair Nabi☆10Updated 7 years ago
- Mastering Machine Learning with Spark 2.x, published by Packt☆43Updated 2 years ago
- Code repository for Large Scale Machine Learning with Python, published by Packt☆90Updated 2 years ago
- JPMML-SparkML plugin for converting LightGBM-Spark models to PMML☆41Updated 3 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 2 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- Learn the pyspark API through pictures and simple examples☆169Updated 4 years ago
- A collection of Hive UDFs☆75Updated 4 years ago
- ☆53Updated 2 years ago
- ☆41Updated 8 years ago
- Film recommendations with Apache Spark and Python☆61Updated 9 years ago