sudarshan-koirala / 30-days-of-DatabricksLinks
30 days of Databricks is a step-by-step guide to learn Databricks in 30 days for complete beginners.
☆18Updated last year
Alternatives and similar repositories for 30-days-of-Databricks
Users that are interested in 30-days-of-Databricks are comparing it to the libraries listed below
Sorting:
- ☆31Updated 11 months ago
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆12Updated last year
- ☆21Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆41Updated last year
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆11Updated last year
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆21Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆78Updated last year
- (WIP) Getting started with Docker - An introduction to Docker with data science and engineering applications☆129Updated last year
- YouTube tutorial project☆104Updated last year
- This project aims to leverage big data technologies to help OTT platforms predict churn at a customer level in real-time. We use Amazon S…☆5Updated 3 years ago
- An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Az…☆23Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆32Updated last year
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆38Updated last year
- Public data and analytics for our open course☆32Updated last year
- ☆20Updated 3 months ago
- Data Engineering with Google Cloud Platform, published by Packt☆117Updated last year
- This project is about building a dimensional data warehouse in BigQuery by transforming an OLTP system to an OLAP system, using dbt as ou…☆12Updated last year
- Resources to learn analytics engineering☆66Updated 3 months ago
- Sample repo for startdataengineering DE 101 free course☆64Updated last year
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆30Updated 2 years ago
- ☆94Updated 2 years ago
- Analysis of the Premier League games and seasons since 1992.☆26Updated 2 years ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆36Updated last year
- Analysis of SQL Leetcode and classic interview questions. Common pitfalls, anti-patterns and handy tricks are discussed. Sample databases…☆46Updated 3 years ago
- Materials for the AI Dev 2024 conference workshop "Deploy and Monitor ML Pipelines with Python, Open Source, and Free Applications"☆94Updated this week
- tokyo-olympic-azure-data-engineering-project☆209Updated 11 months ago
- This is a code repository for the course Data Engineering with Data Build Tool (DBT).☆58Updated 9 months ago
- Databricks ML in Action, Published by Packt☆30Updated last month
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆99Updated 10 months ago
- My current data engineering portfolio. Includes projects spanning ETL, orchestration and dashboarding.☆113Updated last year