Wittline / Dropout-Students-Prediction
The goal of this project is to identify students at risk of dropping out the school
☆22Updated 3 years ago
Alternatives and similar repositories for Dropout-Students-Prediction:
Users that are interested in Dropout-Students-Prediction are comparing it to the libraries listed below
- Challenge Data Engineer☆25Updated 2 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 7 months ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆100Updated 4 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆56Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆45Updated 5 years ago
- A guide to show you how to import data for ETL☆20Updated 2 years ago
- Template for Data Engineering and Data Pipeline projects☆109Updated 2 years ago
- ☆87Updated 2 years ago
- ☆13Updated 2 years ago
- Apache Spark for data engineers☆55Updated 2 years ago
- PySpark Cheatsheet☆36Updated 2 years ago
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated 10 months ago
- Mastering Tableau 2021 published by Packt☆34Updated 2 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆142Updated 4 years ago
- The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …☆120Updated 2 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆45Updated 2 years ago
- Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer C…☆10Updated 4 years ago
- Some example projects for Data Engineers to build, end-to-end.☆28Updated last year
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆24Updated 2 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆14Updated 3 years ago
- ☆12Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Analysis of SQL Leetcode and classic interview questions. Common pitfalls, anti-patterns and handy tricks are discussed. Sample databases…☆46Updated 3 years ago