☆93Sep 14, 2022Updated 3 years ago
Alternatives and similar repositories for data-engineering-spark
Users that are interested in data-engineering-spark are comparing it to the libraries listed below
Sorting:
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- ☆15Aug 18, 2021Updated 4 years ago
- ☆15Jul 31, 2022Updated 3 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆26May 27, 2021Updated 4 years ago
- ☆27Jun 14, 2022Updated 3 years ago
- Events about the open source data stack☆13Apr 16, 2022Updated 3 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Sep 4, 2015Updated 10 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆488Oct 15, 2024Updated last year
- Notebooks for the ML Link Prediction Course☆14Nov 5, 2020Updated 5 years ago
- Unit testing using databricks connect☆32Nov 3, 2021Updated 4 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Apr 16, 2024Updated last year
- This repo contains commands that data engineers use in day to day work.☆61Feb 4, 2023Updated 3 years ago
- ☆13Feb 15, 2023Updated 3 years ago
- Repository for Databricks And Azure Maps Online Workshop Series☆17Mar 21, 2022Updated 3 years ago
- Repository for Microsoft Databricks Training Events - Hosted by BlueGranite☆15Aug 22, 2019Updated 6 years ago
- Big data projects implemented by Maniram yadav☆50May 5, 2018Updated 7 years ago
- Data for the `Data Analysis with Python and PySpark` book☆41Jan 9, 2023Updated 3 years ago
- Content related to Mastering Postgresql along with videos.☆18Aug 18, 2021Updated 4 years ago
- Data Engineering on GCP☆41Oct 20, 2022Updated 3 years ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Jul 23, 2023Updated 2 years ago
- AWS Certified Solutions Architect Professional SAP-C01 New Feb 2019 Version Exam Notes☆17Apr 6, 2019Updated 6 years ago
- ☆18Jun 16, 2024Updated last year
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )☆93Jun 16, 2018Updated 7 years ago
- ☆118Sep 21, 2020Updated 5 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆46Jul 15, 2022Updated 3 years ago
- Databricks Platform - Architecture, Security, Automation and much more!!☆54Updated this week
- Companion repository that goes along with Snowflake's "Introduction to Modern Data Engineering with Snowflake" course on Coursera☆136Feb 25, 2025Updated last year
- ☆24Aug 8, 2021Updated 4 years ago
- Overview of use cases and applications for software-defined radios (SDR)☆28Jan 28, 2023Updated 3 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆27Apr 12, 2023Updated 2 years ago
- Code repository for the "PySpark in Action" book☆214Jun 11, 2025Updated 8 months ago
- Git Repository☆153Jan 9, 2026Updated last month
- ☆28Jan 2, 2023Updated 3 years ago
- DevOps pipeline for Real Time Social/Web Mining☆26Feb 22, 2026Updated last week
- Projects done in the Data Engineering Nanodegree by Udacity.com☆272Aug 7, 2019Updated 6 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 3, 2024Updated last year
- ☆30Feb 25, 2025Updated last year
- Spark cluster in docker containers with sample training Jupyter notebooks☆27Feb 24, 2023Updated 3 years ago