This repo contains commands that data engineers use in day to day work.
☆61Feb 4, 2023Updated 3 years ago
Alternatives and similar repositories for TowardsDataEngineering
Users that are interested in TowardsDataEngineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PySpark Cheatsheet☆36Jan 18, 2023Updated 3 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆497Oct 15, 2024Updated last year
- All Data Engineering notebooks from Datacamp course☆116Dec 11, 2019Updated 6 years ago
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆32Feb 2, 2021Updated 5 years ago
- ☆95Sep 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data engineering interviews Q&A for data community by data community☆68Jun 7, 2020Updated 6 years ago
- ☆27Feb 2, 2018Updated 8 years ago
- Different ways to connect to storage in Azure Databricks☆11Jul 19, 2019Updated 6 years ago
- ☆18Nov 9, 2025Updated 7 months ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆92Jul 17, 2019Updated 6 years ago
- Case Study's from Danny Ma's Serious SQL Course☆19Aug 4, 2022Updated 3 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆364Oct 29, 2022Updated 3 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- This sample demonstrates how to make a use of modules provided by Microsoft Azure File Service in Python.☆11Apr 21, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Jan 22, 2024Updated 2 years ago
- Python ETL demo for Hackforge☆32Oct 11, 2023Updated 2 years ago
- Data Science Learning Notes☆11Oct 18, 2023Updated 2 years ago
- 🚑Android App for people of India during this Pandemic of Covid-19. During COVID-19, people living in societies and apartments may not be…☆16Jan 10, 2021Updated 5 years ago
- Personal Repository of Data Science Projects☆14May 8, 2019Updated 7 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Apr 14, 2023Updated 3 years ago
- ☆11Jan 10, 2026Updated 5 months ago
- Data set and queries that I use in my Hive and Impala presentations. Slides are usually posted at slideshare.net/markgrover☆20May 19, 2014Updated 12 years ago
- A tool to validate data, built around Apache Spark.☆102Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Nov 28, 2022Updated 3 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- Emojify uses ML Models to create Emoji's corresponding to a given sentence.☆18Jan 6, 2021Updated 5 years ago
- various scripts☆20Dec 16, 2022Updated 3 years ago
- Odoo Modules Migration☆11Dec 9, 2025Updated 6 months ago
- FireBase Auth app built to authenticate user via firebase using Email Id , Phone Number , Google , Facebook , Yahoo , Twitter , Github …☆17Jul 29, 2020Updated 5 years ago
- ☆15Sep 14, 2021Updated 4 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Apr 29, 2021Updated 5 years ago
- Example end to end data engineering project.☆1,412Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Personal Data Engineering Projects☆1,017Feb 8, 2023Updated 3 years ago
- JupyterLab UI Testing Framework☆31Sep 2, 2021Updated 4 years ago
- ☆11Jul 13, 2020Updated 5 years ago
- Implement rest api service for manipulating blog contents using FastAPI in Python☆12Feb 14, 2023Updated 3 years ago
- This repo contains all the code used in the Python for Data Engineering Course☆363Apr 24, 2024Updated 2 years ago
- This is the git repo for the tutorials I have done for Time Series Forecasting by Jason Brownlee☆11Dec 5, 2018Updated 7 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago