Companion repository for the book 'Delta Lake Up and Running'
☆49Apr 5, 2025Updated last year
Alternatives and similar repositories for delta-lake-up-and-running
Users that are interested in delta-lake-up-and-running are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- This repository contains the database migration assistant Jupyter Notebook to be used while planning migration to Cosmos DB API for Mongo…☆14Sep 26, 2024Updated last year
- This project represents a whole process of Anime data collection, preparation, and delivery as a data app, powered by technologies like P…☆10Oct 4, 2022Updated 3 years ago
- ☆16Apr 1, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A small example setting Python's logging configuration using a module invoked from a notebook.☆10May 14, 2023Updated 2 years ago
- Kafka Connect SMT to expand JSON field☆20Jan 26, 2026Updated 2 months ago
- A home automation sample that uses Semantic Kernel and Hue lights☆23Mar 14, 2024Updated 2 years ago
- ☆18Dec 2, 2024Updated last year
- The goal of this project is to build an ETL pipeline. The data would be processed as a batch (monthly) between 2018-01 and 2021-02.☆14Mar 26, 2022Updated 4 years ago
- ☆13Oct 12, 2020Updated 5 years ago
- Repository for Databricks And Azure Maps Online Workshop Series☆17Mar 21, 2022Updated 4 years ago
- A collection of useful and awesome Databricks resources☆19Dec 21, 2023Updated 2 years ago
- ☆17Apr 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Anonymize faces in video stream☆10Oct 24, 2022Updated 3 years ago
- Spark and Delta Lake Workshop☆22Jun 14, 2022Updated 3 years ago
- Review System☆11Nov 1, 2019Updated 6 years ago
- Vagrant configuration for VIVO☆37Feb 2, 2021Updated 5 years ago
- ☆64Jan 9, 2024Updated 2 years ago
- ☆26Nov 22, 2022Updated 3 years ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- Collection of dockerized ETL jobs managed by data engineering.☆22Updated this week
- Easily Deploy Code to AWS Lambda☆13Aug 15, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- GDS Patient Journey Demo☆13Jul 25, 2023Updated 2 years ago
- ⚙️ Airflow data pipeline with Terraform, GCP BigQuery, dbt, Soda and Looker Studio.☆24Oct 19, 2023Updated 2 years ago
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆46May 16, 2024Updated last year
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆61Jul 2, 2018Updated 7 years ago
- ☆19Jun 22, 2022Updated 3 years ago
- A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, Azure Cosmos DB for MongoDB vCore to…☆38Nov 24, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 5 years ago
- ☆11Oct 6, 2023Updated 2 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- Swift way to explore OO and functional design patterns☆39Jun 27, 2019Updated 6 years ago
- Run Ollama on Kubernetes☆46Nov 19, 2024Updated last year
- Birds 400-Species Image Classification using Pytorch Metric Learning (Triplet Margin Loss)☆13Nov 1, 2022Updated 3 years ago
- Source Code for 'Beginning Apache Spark 3' by Hien Luu☆13Oct 14, 2021Updated 4 years ago