Databricks. Incremental data processing, task orchestration, and production job monitoring.
☆45Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for Advanced-Data-Engineering-with-Databricks
Users that are interested in Advanced-Data-Engineering-with-Databricks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆204Dec 12, 2025Updated 6 months ago
- Azure Databricks workshops with content on connectivity to Azure services, data engineering workflows and data sciences notebooks.☆11Feb 20, 2019Updated 7 years ago
- Capstone Project for DataExpert.io V4 Cohort☆13Jul 8, 2024Updated last year
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆146Aug 11, 2024Updated last year
- 65 Articles on SQL: A Comprehensive Guide to Mastering Advanced SQL☆11Jun 7, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10May 5, 2022Updated 4 years ago
- ☆17Dec 23, 2021Updated 4 years ago
- A data pipeline project build on databricks and azure to demostrate lifecycle of a cloud data project.☆18Jan 12, 2022Updated 4 years ago
- ☆18Apr 6, 2025Updated last year
- validate data stored in CSV, PRN, ODS or Excel files☆20May 10, 2026Updated last month
- Repo for the Advanced Python Skills course that I created (hosted in Udemy and Skillshare)☆15Nov 1, 2020Updated 5 years ago
- Optimizing Databricks Workload, published by Packt☆18Apr 22, 2026Updated last month
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 4 years ago
- What the Hack Challenge format of the Advanced Databricks Workshop☆17Mar 15, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Jun 8, 2026Updated last week
- Web App and Random Forest - Loan Approval Prediction☆22Oct 19, 2022Updated 3 years ago
- apache-spark-with-databricks-for-data-engineering☆103Jul 3, 2024Updated last year
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆244Aug 11, 2024Updated last year
- ☆10Nov 2, 2023Updated 2 years ago
- ☆28Jun 14, 2022Updated 4 years ago
- Minimal celery example with local filesystem broker + backend☆14Mar 19, 2019Updated 7 years ago
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- A Python CLI application that demonstrates how you can access AWS services, such as Amazon S3 and Amazon Athena, using trusted identity p…☆13Mar 11, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Streaming Generative AI Application on AWS☆14Jun 24, 2024Updated last year
- PySpark data-pipeline testing and CICD☆28Oct 28, 2020Updated 5 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆104Sep 26, 2025Updated 8 months ago
- PyQL 🐍 is a SQL-like query language to run on Python source code files instead of database files using the GitQL SDK.☆34Dec 22, 2024Updated last year
- Testing various methods of moving Arrow data between processes☆17Mar 29, 2023Updated 3 years ago
- ☆16Oct 18, 2023Updated 2 years ago
- A Template for MLOps on Google Cloud Vertex AI☆13Mar 16, 2022Updated 4 years ago
- ☆20Mar 13, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Apr 16, 2024Updated 2 years ago
- Real-time OLTP system for credit card fraud detection using AWS API Gateway, Kinesis, and RDS PostgreSQL. Features a scalable, serverless…☆25Dec 16, 2024Updated last year
- Sample application showcasing the use of Dapr to build microservices based apps☆15Feb 4, 2026Updated 4 months ago
- This repository contains solutions to the SQL challenges posted on DataLemur website using PostgreSQL environment☆23Oct 17, 2022Updated 3 years ago
- A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automatio…☆72Jun 12, 2026Updated last week
- Terragrunt friendly module to create AWS API Gateway (V1) w\Optional WAF, many stages/api keys/usage plans using the OpenAPI 3.x spec. 🇺…☆11Feb 13, 2026Updated 4 months ago
- ☆14Dec 24, 2025Updated 5 months ago