Companion repository for the book 'Delta Lake Up and Running'
☆49Apr 5, 2025Updated last year
Alternatives and similar repositories for delta-lake-up-and-running
Users that are interested in delta-lake-up-and-running are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Azure Cosmos DB's Graph API provides the graph data model and Gremlin. This tutorial shows how to get started with the Graph (Gremlin) AP…☆10May 19, 2022Updated 4 years ago
- Alternative graph implementations built on the Azure CosmosDB SQL API, Java, Spring Boot, and Spring Data☆11Sep 23, 2024Updated last year
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- ☆63Feb 1, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 6 years ago
- A small example setting Python's logging configuration using a module invoked from a notebook.☆10May 14, 2023Updated 3 years ago
- Analysis code for Schmaelzle, O'Donnell et al. (2017) PNAS☆10May 7, 2017Updated 9 years ago
- Implementing CI/CD Using Azure Pipelines, published by Packt☆13Nov 29, 2023Updated 2 years ago
- ☆18Dec 2, 2024Updated last year
- The goal of this project is to build an ETL pipeline. The data would be processed as a batch (monthly) between 2018-01 and 2021-02.☆14Mar 26, 2022Updated 4 years ago
- A home automation sample that uses Semantic Kernel and Hue lights☆24Mar 14, 2024Updated 2 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Python - Complete Python, Django, Data Science and ML Guide, published by Packt☆15Dec 15, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Anonymize faces in video stream☆10Oct 24, 2022Updated 3 years ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Spark and Delta Lake Workshop☆22Jun 14, 2022Updated 3 years ago
- ☆17Aug 31, 2023Updated 2 years ago
- ☆64Jan 9, 2024Updated 2 years ago
- Magic to help Spark pipelines upgrade☆33Sep 29, 2024Updated last year
- Code examples for my blog posts☆22Nov 7, 2018Updated 7 years ago
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- Azure Data Factory Cookbook_Second Edition, published by Packt☆19Feb 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of dagster code snippets for data engineers☆56Feb 26, 2024Updated 2 years ago
- CDF FAQ☆11Aug 16, 2022Updated 3 years ago
- ☆14Nov 22, 2024Updated last year
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- ⚙️ Airflow data pipeline with Terraform, GCP BigQuery, dbt, Soda and Looker Studio.☆25Oct 19, 2023Updated 2 years ago
- Source code for 'Pro Power BI Desktop' by Adam Aspin☆13Mar 28, 2017Updated 9 years ago
- Jupyter notebooks for the ML course at UCI☆18Dec 1, 2017Updated 8 years ago
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆46May 16, 2024Updated 2 years ago
- Udacity Data Engineering Nanodegree Projects☆11Sep 5, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for dbt tutorial☆178Sep 9, 2025Updated 8 months ago
- ☆21Dec 11, 2021Updated 4 years ago
- ☆19Jun 22, 2022Updated 3 years ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆19Feb 27, 2023Updated 3 years ago
- ☆11Oct 6, 2023Updated 2 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 3 years ago
- MoVie revieW inspired by fastfetch☆46Jan 31, 2026Updated 3 months ago