Companion repository for the book 'Delta Lake Up and Running'
☆48Apr 5, 2025Updated 11 months ago
Alternatives and similar repositories for delta-lake-up-and-running
Users that are interested in delta-lake-up-and-running are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Azure Cosmos DB's Graph API provides the graph data model and Gremlin. This tutorial shows how to get started with the Graph (Gremlin) AP…☆10May 19, 2022Updated 3 years ago
- Alternative graph implementations built on the Azure CosmosDB SQL API, Java, Spring Boot, and Spring Data☆11Sep 23, 2024Updated last year
- This repository contains the database migration assistant Jupyter Notebook to be used while planning migration to Cosmos DB API for Mongo…☆14Sep 26, 2024Updated last year
- ☆61Feb 1, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 6 years ago
- A small example setting Python's logging configuration using a module invoked from a notebook.☆10May 14, 2023Updated 2 years ago
- Analysis code for Schmaelzle, O'Donnell et al. (2017) PNAS☆10May 7, 2017Updated 8 years ago
- Implementing CI/CD Using Azure Pipelines, published by Packt☆13Nov 29, 2023Updated 2 years ago
- Kafka Connect SMT to expand JSON field☆20Jan 26, 2026Updated 2 months ago
- ☆18Dec 2, 2024Updated last year
- The goal of this project is to build an ETL pipeline. The data would be processed as a batch (monthly) between 2018-01 and 2021-02.☆14Mar 26, 2022Updated 4 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Using Selenium and Beautiful Soup to scrape marathon images☆10Feb 21, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Repository for Databricks And Azure Maps Online Workshop Series☆17Mar 21, 2022Updated 4 years ago
- Anonymize faces in video stream☆10Oct 24, 2022Updated 3 years ago
- Spark and Delta Lake Workshop☆22Jun 14, 2022Updated 3 years ago
- ☆17Aug 31, 2023Updated 2 years ago
- Esse repositório contém todos os códigos que desenvolvi durante o curso Leitura e Manipulação de Dados em Python oferecido no meu canal n…☆13Aug 9, 2021Updated 4 years ago
- ☆64Jan 9, 2024Updated 2 years ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- Collection of dockerized ETL jobs managed by data engineering.☆21Mar 23, 2026Updated last week
- Code examples for my blog posts☆22Nov 7, 2018Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- GDS Patient Journey Demo☆13Jul 25, 2023Updated 2 years ago
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)☆14Oct 8, 2022Updated 3 years ago
- Source code for 'Pro Power BI Desktop' by Adam Aspin☆13Mar 28, 2017Updated 9 years ago
- Jupyter notebooks for the ML course at UCI☆18Dec 1, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆61Jul 2, 2018Updated 7 years ago
- Udacity Data Engineering Nanodegree Projects☆11Sep 5, 2019Updated 6 years ago
- Modeling customer churn with Spark☆12Jan 24, 2019Updated 7 years ago
- Code for dbt tutorial☆173Sep 9, 2025Updated 6 months ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆19Feb 27, 2023Updated 3 years ago
- A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, Azure Cosmos DB for MongoDB vCore to…☆37Nov 24, 2025Updated 4 months ago
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 5 years ago