☆93Sep 14, 2022Updated 3 years ago
Alternatives and similar repositories for data-engineering-spark
Users that are interested in data-engineering-spark are comparing it to the libraries listed below
Sorting:
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- ☆15Aug 18, 2021Updated 4 years ago
- An example of an ETL pipeline that lays out generic DE processes. This is now out of date but still provides useful information☆29Apr 22, 2022Updated 3 years ago
- This repo contains commands that data engineers use in day to day work.☆61Feb 4, 2023Updated 3 years ago
- Repository for Microsoft Databricks Training Events - Hosted by BlueGranite☆15Aug 22, 2019Updated 6 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆490Oct 15, 2024Updated last year
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Apr 16, 2024Updated last year
- Unit testing using databricks connect☆32Nov 3, 2021Updated 4 years ago
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆26May 27, 2021Updated 4 years ago
- Data Engineering on GCP☆41Oct 20, 2022Updated 3 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated last month
- ☆20Aug 20, 2019Updated 6 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- ☆13Dec 30, 2022Updated 3 years ago
- Data for the `Data Analysis with Python and PySpark` book☆41Jan 9, 2023Updated 3 years ago
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )☆93Jun 16, 2018Updated 7 years ago
- Repository for Databricks And Azure Maps Online Workshop Series☆17Mar 21, 2022Updated 4 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Sep 4, 2015Updated 10 years ago
- Content related to Mastering Postgresql along with videos.☆19Aug 18, 2021Updated 4 years ago
- ☆24Aug 8, 2021Updated 4 years ago
- Events about the open source data stack☆13Apr 16, 2022Updated 3 years ago
- 🎲 Repositório para armazenar todos os componentes referentes a Data Science / Data Engineering do projeto☆18Oct 27, 2023Updated 2 years ago
- Databricks Platform - Architecture, Security, Automation and much more!!☆55Updated this week
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Jul 23, 2023Updated 2 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆12Jul 9, 2024Updated last year
- Delta-Lake, ETL, Spark, Airflow☆48Oct 9, 2022Updated 3 years ago
- Big data projects implemented by Maniram yadav☆50May 5, 2018Updated 7 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆272Mar 1, 2026Updated 3 weeks ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆28Apr 12, 2023Updated 2 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆57Dec 4, 2023Updated 2 years ago
- Apache Spark for data engineers☆58Jul 28, 2022Updated 3 years ago
- Ravi Azure ADB ADF Repository☆65Jan 25, 2025Updated last year
- ☆14Sep 14, 2021Updated 4 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Sep 17, 2018Updated 7 years ago
- ☆55Nov 13, 2020Updated 5 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 3, 2024Updated 2 years ago
- Code base for airflow training series Getting easy with Apache Airflow☆42Aug 13, 2023Updated 2 years ago