paypay / DataEngineerChallenge
β21Updated 2 years ago
Related projects β
Alternatives and complementary repositories for DataEngineerChallenge
- Weekly Data Engineering Newsletterβ93Updated 4 months ago
- Various data stream/batch process demo with Apache Scala Spark πβ11Updated 4 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot moreβ19Updated 2 years ago
- An example PySpark project with pytestβ17Updated 7 years ago
- A Giter8 template for scioβ30Updated 3 weeks ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.β46Updated last year
- Magic to help Spark pipelines upgradeβ33Updated last month
- (project & tutorial) dag pipeline tests + ci/cd setupβ85Updated 3 years ago
- data engineering 100 days π€ 𧲠𦾠| #DEβ37Updated last year
- Flowchart for debugging Spark applicationsβ101Updated last month
- Data validation library for PySpark 3.0.0β34Updated 2 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0β25Updated 3 years ago
- β11Updated 2 years ago
- Airflow training for the crunch confβ105Updated 6 years ago
- A tutorial on Apache Spark Unit Testingβ37Updated 8 years ago
- Template for Spark Projectsβ101Updated 5 months ago
- β16Updated last year
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus onβ¦β25Updated 2 years ago
- The official repository for the Rock the JVM Spark Optimization 2 courseβ37Updated 11 months ago
- Skeleton project for Apache Airflow training participants to work on.β16Updated 4 years ago
- Because its never late to start taking notes and 'public' it...β60Updated 3 weeks ago
- The iterative broadcast join example code.β69Updated 7 years ago
- β74Updated 4 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizerβ25Updated 2 years ago
- Building Big Data Pipelines with Apache Beam, published by Packtβ83Updated last year
- An ETL framework in Scala for Data Engineersβ22Updated 2 years ago
- Filling in the Spark function gaps across APIsβ50Updated 3 years ago
- An Introduction to Scalaβ23Updated last year
- Real-world Spark pipelines examplesβ83Updated 6 years ago
- β30Updated 5 years ago