☆95Sep 14, 2022Updated 3 years ago
Alternatives and similar repositories for data-engineering-spark
Users that are interested in data-engineering-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- ☆28Jun 14, 2022Updated 3 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- ☆15Jul 31, 2022Updated 3 years ago
- ☆15Aug 18, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An example of an ETL pipeline that lays out generic DE processes. This is now out of date but still provides useful information☆30Apr 22, 2022Updated 4 years ago
- PySpark Cheatsheet☆36Jan 18, 2023Updated 3 years ago
- Repository for Microsoft Databricks Training Events - Hosted by BlueGranite☆15Aug 22, 2019Updated 6 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆492Oct 15, 2024Updated last year
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Apr 16, 2024Updated 2 years ago
- A shell script to automate the operations of sqoop☆11Mar 29, 2021Updated 5 years ago
- Unit testing using databricks connect☆32Nov 3, 2021Updated 4 years ago
- Data Engineering on GCP☆41Oct 20, 2022Updated 3 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆20Aug 20, 2019Updated 6 years ago
- ☆13Dec 30, 2022Updated 3 years ago
- Data for the `Data Analysis with Python and PySpark` book☆42Jan 9, 2023Updated 3 years ago
- Repository for Databricks And Azure Maps Online Workshop Series☆17Mar 21, 2022Updated 4 years ago
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )☆94Jun 16, 2018Updated 7 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Sep 4, 2015Updated 10 years ago
- Content related to Mastering Postgresql along with videos.☆20Aug 18, 2021Updated 4 years ago
- Repository containing example solutions for the Data Engineering Career Path Portfolio Projects☆18Sep 16, 2022Updated 3 years ago
- ☆24Aug 8, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Events about the open source data stack☆13Apr 16, 2022Updated 4 years ago
- 🎲 Repositório para armazenar todos os componentes referentes a Data Science / Data Engineering do projeto☆17Oct 27, 2023Updated 2 years ago
- Databricks Platform - Architecture, Security, Automation and much more!!☆56Apr 7, 2026Updated 3 weeks ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Jul 23, 2023Updated 2 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆12Jul 9, 2024Updated last year
- ☆22Feb 5, 2024Updated 2 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆274Mar 1, 2026Updated 2 months ago
- Big data projects implemented by Maniram yadav☆50May 5, 2018Updated 7 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆57Dec 4, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Apache Spark for data engineers☆58Jul 28, 2022Updated 3 years ago
- IBM Data Engineering Professional Certificate☆35May 10, 2025Updated 11 months ago
- Ravi Azure ADB ADF Repository☆65Jan 25, 2025Updated last year
- The perf collector will capture resource utilization for a database server and create a CSV file to be uploaded to the Azure SQL Database…☆11Dec 13, 2017Updated 8 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- ☆56Nov 13, 2020Updated 5 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Sep 17, 2018Updated 7 years ago