☆63Jan 9, 2024Updated 2 years ago
Alternatives and similar repositories for Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse
Users that are interested in Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark 3 - Structured Streaming Course Material☆126Aug 19, 2023Updated 2 years ago
- Azure Data Factory Cookbook_Second Edition, published by Packt☆19Feb 29, 2024Updated 2 years ago
- For Udemy students: the official repository of Rock the JVM's Spark Streaming course☆26Jan 5, 2023Updated 3 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- Unit testing using databricks connect☆32Nov 3, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Git Repo for EDW Best Practice Assets on the Lakehouse☆16Dec 11, 2023Updated 2 years ago
- Resources for the Udemy Course - Azure Data Factory For Data Engineers - Project on Covid19 by Ramesh Retnasamy☆274Feb 10, 2024Updated 2 years ago
- ☆87Mar 26, 2025Updated last year
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆34Nov 9, 2023Updated 2 years ago
- 🔧 Azure Data Engineering Project (On-premise db to the cloud)☆20Mar 30, 2024Updated 2 years ago
- This repository shows my personal notes taken while doing the Udacity Data engineering Nanodegree☆13May 28, 2020Updated 6 years ago
- Local SQL Database ---> Azure ---> Power BI☆15Oct 13, 2023Updated 2 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆44May 31, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Jun 3, 2023Updated 3 years ago
- Hands-on Serverless computing with Go [video], published by Packt☆14Oct 28, 2022Updated 3 years ago
- Data Guy Story commandline☆11Dec 2, 2022Updated 3 years ago
- Examples for the Testing In Python Book☆13Mar 26, 2026Updated 2 months ago
- ☆17Apr 1, 2025Updated last year
- ☆11Feb 7, 2021Updated 5 years ago
- A small example setting Python's logging configuration using a module invoked from a notebook.☆10May 14, 2023Updated 3 years ago
- Code for NeurIPS 2024 paper "A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language Models"☆16Oct 17, 2024Updated last year
- Stacked Pull Request Tool☆11Jul 8, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Apache Spark 3 - Spark Programming in Python for Beginners☆514Jul 25, 2024Updated last year
- ☆16Apr 8, 2023Updated 3 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- Companion repository for the book 'Delta Lake Up and Running'☆50Apr 5, 2025Updated last year
- Repository for Databricks And Azure Maps Online Workshop Series☆17Mar 21, 2022Updated 4 years ago
- Python - Complete Python, Django, Data Science and ML Guide, published by Packt☆16Dec 15, 2025Updated 5 months ago
- Azure Data Engineer Associate Certification Guide, published by Packt☆80Apr 22, 2026Updated last month
- Snowflake Data Engineering in Action☆41Oct 18, 2024Updated last year
- Real-world AI engineering dataset creation, SFT fine-tuning, and GRPO alignment ETL pipeline.☆34Aug 27, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This dbt starter project template is using the Google Analytics 4 BigQuery exports as input for some practical examples / models to showc…☆41Feb 9, 2026Updated 4 months ago
- Projects and studies regarding Data Engineering Area☆22May 27, 2024Updated 2 years ago
- Apache Spark 3 - Structured Streaming Course Material☆46Sep 8, 2020Updated 5 years ago
- Entity Framework Interceptor to apply query and table hints to queries generated by Entity Framework☆12Feb 1, 2019Updated 7 years ago
- ☆14Feb 3, 2020Updated 6 years ago
- Includes Final Project (Python), Wireshark Labs, and Theoretical HWs☆13Sep 27, 2021Updated 4 years ago
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆23Oct 15, 2024Updated last year