zekeriyyaa / Apache-Spark-Structured-Streaming-Via-Docker-ComposeLinks
☆13Updated 2 years ago
Alternatives and similar repositories for Apache-Spark-Structured-Streaming-Via-Docker-Compose
Users that are interested in Apache-Spark-Structured-Streaming-Via-Docker-Compose are comparing it to the libraries listed below
Sorting:
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Dockerizing an Apache Spark Standalone Cluster☆42Updated 3 years ago
- Data Engineering with Spark and Delta Lake☆106Updated 2 years ago
- Python library for automating administration and data science in Strategy One environments☆97Updated last month
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- ☆88Updated 3 years ago
- Examples surrounding Databricks.☆60Updated last year
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 5 years ago
- Airflow training for the crunch conf☆104Updated 7 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆135Updated 3 years ago
- spark on kubernetes☆104Updated 2 years ago
- Delta Lake Documentation☆51Updated last year
- EverythingApacheNiFi☆116Updated 2 years ago
- ☆58Updated 11 months ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Updated 4 years ago
- Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collab…☆40Updated 5 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 3 years ago
- ☆152Updated 7 years ago
- Course Material☆25Updated 2 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated last year
- Spark data pipeline that processes movie ratings data.☆31Updated last week
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 7 years ago
- Snowflake Cookbook, published by Packt☆82Updated 2 years ago
- Delta Lake examples☆236Updated last year
- ☆94Updated 2 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 7 months ago
- PySpark Cheatsheet☆36Updated 2 years ago
- Airflow helm chart for AWS EKS☆20Updated 4 years ago
- Sample Airflow DAGs☆64Updated 3 years ago
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆36Updated last year