Anant / example-airflow-and-spark
☆12Updated 2 years ago
Related projects: ⓘ
- ☆84Updated 2 years ago
- End-to-end Kafka Streaming Examples on Databricks with Evolving Avro Schemas.☆9Updated 6 months ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆79Updated 5 years ago
- Apache Spark 3 - Structured Streaming Course Material☆120Updated last year
- ☆20Updated this week
- Ravi Azure ADB ADF Repository☆64Updated 4 months ago
- Data Engineering com Apache Spark☆43Updated 3 years ago
- ☆30Updated last year
- Docker with Airflow and Spark standalone cluster☆239Updated last year
- Resources for video demonstrations and blog posts related to DataOps on AWS☆166Updated 2 years ago
- Guide for databricks spark certification☆57Updated 3 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆39Updated 5 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆73Updated 9 months ago
- Near real time ETL to populate a dashboard.☆69Updated 3 months ago
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆44Updated last year
- Delta-Lake, ETL, Spark, Airflow☆42Updated last year
- ETL pipeline using pyspark (Spark - Python)☆106Updated 4 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆39Updated 3 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆127Updated 4 years ago
- Spark, Airflow, Kafka☆27Updated last year
- This repo contains commands that data engineers use in day to day work.☆58Updated last year
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆48Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆57Updated 4 months ago
- The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …☆111Updated 2 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 4 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆50Updated 2 years ago
- ☆39Updated this week
- Data Engineering on GCP☆29Updated last year
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆91Updated last month
- Unit testing using databricks connect☆29Updated 2 years ago