findmypast / recruitment-test-data-engineering
Code test for data engineering candidates
β47Updated 10 months ago
Alternatives and similar repositories for recruitment-test-data-engineering:
Users that are interested in recruitment-test-data-engineering are comparing it to the libraries listed below
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflowβ137Updated 4 years ago
- πComplete End to End ETL Pipeline with Spark, Airflow, & AWSβ43Updated 5 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testingβ256Updated 7 months ago
- Simple stream processing pipelineβ98Updated 8 months ago
- Sample project to demonstrate data engineering best practicesβ179Updated 11 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.β79Updated 6 months ago
- β87Updated 2 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboaβ¦β218Updated 2 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMRβ80Updated 5 years ago
- β119Updated last week
- Data Engineering examples for Airflow, Prefect, and Mage.ai; dbt for BigQuery, Redshift, ClickHouse, PostgreSQL; Spark/PySpark for Batch β¦β63Updated this week
- β37Updated last year
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.β309Updated 3 years ago
- Code for dbt tutorialβ151Updated 8 months ago
- PySpark functions and utilities with examples. Assists ETL process of data modelingβ102Updated 4 years ago
- Classwork projects and home works done through Udacity data engineering nano degreeβ74Updated last year
- β135Updated 2 years ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset whβ¦β12Updated 2 years ago
- Data pipeline that scrapes Rust cheater Steam profilesβ52Updated 3 years ago
- β149Updated 2 years ago
- Hey this is the repo that has all the queries and data for my video game training series!β142Updated 2 years ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and moreβ322Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviewsβ106Updated 9 months ago
- Template for Data Engineering and Data Pipeline projectsβ106Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.comβ107Updated 2 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.β140Updated 4 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWSβ172Updated 3 years ago
- β32Updated last year
- This repo contains commands that data engineers use in day to day work.β60Updated 2 years ago
- This is a template you can use for your next data engineering portfolio project.β173Updated 3 years ago