newfront / spark-intro-to-mlLinks
A Gentle introduction to Machine Learning with Apache Spark
☆11Updated 2 years ago
Alternatives and similar repositories for spark-intro-to-ml
Users that are interested in spark-intro-to-ml are comparing it to the libraries listed below
Sorting:
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆40Updated last year
- Spark and Hive docker containers sharing a common MySQL metastore☆26Updated 5 years ago
- Magic to help Spark pipelines upgrade☆35Updated 8 months ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 4 months ago
- Basic getting started with Kafka examples☆47Updated 6 years ago
- ☆57Updated 9 months ago
- A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 6 years ago
- Flowchart for debugging Spark applications☆105Updated 8 months ago
- Code snippets used in demos recorded for the blog.☆37Updated last month
- ☆81Updated last year
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆68Updated last year
- A tool to validate data, built around Apache Spark.☆101Updated 3 weeks ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆75Updated 2 years ago
- ☆11Updated 6 years ago
- Sample Spark Code☆91Updated 6 years ago
- Learn the Confluent Schema Registry & REST Proxy☆191Updated last year
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆20Updated 3 years ago
- AWS Big Data Certification☆25Updated 4 months ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Updated last year
- Examples of using the DataStax Apache Kafka Connector.☆46Updated last year
- An example Apache Beam project.☆111Updated 8 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆75Updated 6 years ago
- A pyspark lib to validate data quality☆18Updated 2 years ago
- Sample processing code using Spark 2.1+ and Scala☆52Updated 4 years ago
- Spark DataFrame transformation and UDF test examples☆23Updated 2 years ago