In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.
☆39Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for modern-data-eng-dbt-databricks-azure
Users that are interested in modern-data-eng-dbt-databricks-azure are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆11Oct 11, 2023Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆48Dec 11, 2023Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆42May 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆331Feb 14, 2025Updated last year
- A Python wrapper for the Iterable API☆12Jan 7, 2026Updated 5 months ago
- The best calendar table you'll ever find! Generate a calendar table with many columns of date dimensions and metadata. Output to datafra…☆12Mar 27, 2026Updated 2 months ago
- ms-dataverse is a Python module for Microsoft Dataverse, offering a lightweight ORM to query, create, update, and delete entities. Utiliz…☆13Apr 10, 2023Updated 3 years ago
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆217Oct 23, 2023Updated 2 years ago
- This is an end to end MLOps system☆34Nov 27, 2025Updated 6 months ago
- This repository showcases a collection of machine learning projects in various domains, demonstrating my skills and expertise as a data s…☆12Nov 20, 2023Updated 2 years ago
- projek mengenai NLP dan model deployment☆10Feb 1, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆25Jun 11, 2026Updated last week
- ☆16Dec 23, 2023Updated 2 years ago
- ☆16Jan 13, 2021Updated 5 years ago
- Scrapper and analyzer of shared scooter data☆11Jul 30, 2024Updated last year
- Demoing how to use Matrix and Each definitions in Azure DevOps YAML pipelines.☆19Apr 1, 2026Updated 2 months ago
- Demo of Machine Learning Prediction Model API with Django REST API Framework☆10Dec 22, 2019Updated 6 years ago
- ☆14Apr 20, 2023Updated 3 years ago
- ☆13Oct 6, 2023Updated 2 years ago
- 📷 💾 Python bulk instagram scraper for photos and videos using Selenium and BS4.☆17Jun 15, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Superstore Sales with Streamlit is a data visualization and analysis project that uses the Streamlit framework to create an interactive w…☆24Aug 24, 2023Updated 2 years ago
- Architectural design for incorporating a Data Lakehouse architecture with an Enterprise Power BI Deployment☆19Apr 28, 2023Updated 3 years ago
- The Data Explorer and Machine Learning App☆14Feb 22, 2026Updated 4 months ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆29Apr 12, 2023Updated 3 years ago
- End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - interpretable dynamic clustering☆21Jul 12, 2025Updated 11 months ago
- 30 days of making a map a day, using Wherobots Cloud & SedonaDB☆12Mar 1, 2024Updated 2 years ago
- dbt + Trino demo project, using TPC-H sample data☆19Mar 27, 2024Updated 2 years ago
- This formatter which is for handling parameters and file uploaded to Web API controller.☆26Dec 7, 2022Updated 3 years ago
- ☆25Jul 9, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is a repo with links to everything you'd ever want to learn about data engineering☆12Dec 3, 2024Updated last year
- ☆11Apr 25, 2021Updated 5 years ago
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆75Mar 9, 2026Updated 3 months ago
- This is "Your Private StackOverflow" app that helps you perform generative search in your code bases. This is built using open-source sta…☆11Aug 14, 2023Updated 2 years ago
- Exploring the effect of COVID-19 in air pollution by using satellite data, with the sentinelsat and cartopy libraries.☆14Jun 3, 2020Updated 6 years ago
- The Power BI Cheat Sheet is a PDF crammed with Power BI best practices, tips and tricks based on years of experience! You can help too! D…☆24Nov 5, 2019Updated 6 years ago
- another express js mvc framework using mongo db (noSQL) and mvc design pattern to make restfull api's calls, there is also jwt to protect…☆16Nov 17, 2022Updated 3 years ago