Anant / example-airflow-and-sparkLinks
☆12Updated 3 years ago
Alternatives and similar repositories for example-airflow-and-spark
Users that are interested in example-airflow-and-spark are comparing it to the libraries listed below
Sorting:
- Docker with Airflow and Spark standalone cluster☆261Updated 2 years ago
- ☆88Updated 3 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆144Updated this week
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆56Updated 2 years ago
- Apache Spark 3 - Structured Streaming Course Material☆124Updated 2 years ago
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆47Updated 2 years ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆507Updated last month
- ☆137Updated 8 months ago
- Data Engineering with Google Cloud Platform, published by Packt☆119Updated 2 years ago
- Ravi Azure ADB ADF Repository☆64Updated 9 months ago
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Updated 2 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆181Updated 3 years ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆118Updated last year
- Code snippets for Data Engineering Design Patterns book☆249Updated 7 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆64Updated 2 years ago
- ☆56Updated last year
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆479Updated last year
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆88Updated 6 years ago
- Companion repository for the book 'Delta Lake Up and Running'☆47Updated 6 months ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆202Updated last year
- ☆90Updated 8 months ago
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆157Updated 5 years ago
- ☆29Updated last year
- Master Big Data With PySpark and AWS☆131Updated 2 years ago
- ETL pipeline using pyspark (Spark - Python)☆116Updated 5 years ago
- Unit testing using databricks connect☆32Updated 3 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆279Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆172Updated last month