apache-spark-with-databricks-for-data-engineering
☆100Jul 3, 2024Updated last year
Alternatives and similar repositories for apache-spark-with-data-bricks-for-data-engineering
Users that are interested in apache-spark-with-data-bricks-for-data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- data-warehouse-snowflake-for-data-engineering☆19Sep 14, 2023Updated 2 years ago
- This is a comprehensive end-to-end data engineering project. I extracted data directly from YouTube in raw JSON format using Python and A…☆11Jun 4, 2024Updated last year
- Azure Databricks workshops with content on connectivity to Azure services, data engineering workflows and data sciences notebooks.☆11Feb 20, 2019Updated 7 years ago
- Repository for Microsoft Databricks Training Events - Hosted by BlueGranite☆15Aug 22, 2019Updated 6 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆43Sep 26, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Udacity Data Engineering Nanodegree Projects☆11Sep 5, 2019Updated 6 years ago
- The Christmas Project is a festive-themed data engineering initiative designed to integrate and analyze diverse datasets, creating a comp…☆19Jan 11, 2025Updated last year
- ☆15Dec 23, 2021Updated 4 years ago
- ☆16Apr 1, 2025Updated last year
- ☆94Dec 17, 2024Updated last year
- This repository contains Data Science with python project datasets☆17Apr 12, 2019Updated 7 years ago
- It's a Github Repo to get an understanding on various pre-processing steps required in Machine Learning before we build Machine Learning …☆29Jul 23, 2019Updated 6 years ago
- ☆27Aug 28, 2023Updated 2 years ago
- Code samples for Ingest data with Microsoft Fabric notebooks☆10Jul 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Nov 9, 2025Updated 5 months ago
- An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Az…☆32Oct 2, 2023Updated 2 years ago
- Curated List for 2025 Full Time Job Openings by DataDrooler☆25Aug 25, 2025Updated 8 months ago
- ☆15Sep 28, 2023Updated 2 years ago
- ☆19Jun 22, 2022Updated 3 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- Source Code for 'Azure Data Factory' by Example by Richard Swinbank☆17Jun 21, 2021Updated 4 years ago
- ☆15Aug 28, 2025Updated 8 months ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆22Jul 9, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Awesome list for datapipeline☆35Feb 6, 2023Updated 3 years ago
- Data Structures and Algorithms☆21Feb 23, 2026Updated 2 months ago
- ☆13Mar 7, 2025Updated last year
- Personal blog post set up using jekyll☆16Apr 14, 2026Updated 2 weeks ago
- ☆19Updated this week
- This project provides valuable customer sentiment insights for Zomato by tracking and analyzing tweets related to their brand and service…☆14Aug 27, 2023Updated 2 years ago
- Samples for fabric user data functions☆27Mar 16, 2026Updated last month
- my solutions of the python exercises from w3resource.com website☆17Mar 7, 2021Updated 5 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆11Dec 19, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆28Jul 9, 2025Updated 9 months ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 3 years ago
- This repository is aimed to make Modern-Robot-Learning accessible to all.☆27Jan 16, 2026Updated 3 months ago
- Mirror of the database of postal codes from GeoNames☆20Apr 12, 2024Updated 2 years ago
- "Linux is a superbly polished copy of an antique - shinier than the original, perhaps, but still defined by it." ― Jaron Lanier☆30Nov 4, 2021Updated 4 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- ☆215Aug 13, 2023Updated 2 years ago