dacort/modern-data-lake-storage-layers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dacort/modern-data-lake-storage-layers)

dacort / modern-data-lake-storage-layers

Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work

☆47

Alternatives and similar repositories for modern-data-lake-storage-layers

Users that are interested in modern-data-lake-storage-layers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / amazon-redshift-streaming-workshop
View on GitHub
This repository provides the resources required for the Amazon Redshift Streaming workshop
☆13Apr 13, 2026Updated 2 months ago
aws-samples / amazon-emr-with-delta-lake
View on GitHub
Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR
☆17Apr 27, 2025Updated last year
aws-samples / emr-studio-notebook-examples
View on GitHub
This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.
☆53Oct 31, 2023Updated 2 years ago
arezamoosavi / AcidOnSpark-ETL
View on GitHub
Delta-Lake, ETL, Spark, Airflow
☆50Oct 9, 2022Updated 3 years ago
scalatest / autofix
View on GitHub
Auto-fixing error due to version upgrade, good practice etc.
☆11Sep 5, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
aws-samples / emr-on-eks-hudi-iceberg-delta
View on GitHub
☆18Jun 16, 2024Updated 2 years ago
scalablescripts / node-search-mysql
View on GitHub
☆11Apr 27, 2021Updated 5 years ago
aws-samples / emr-on-eks-benchmark
View on GitHub
☆32Jul 2, 2026Updated last week
bmalnad / s3-pgp-encryptor
View on GitHub
AWS Lambda function - automatically PGP encrypts files added to S3 bucket
☆16May 3, 2022Updated 4 years ago
node-red / node-red-auth-github
View on GitHub
A GitHub authentication plugin for Node-RED
☆19Aug 14, 2021Updated 4 years ago
tobilg / caddy-duckdb-module
View on GitHub
A Caddy server module that provides a REST API for DuckDB database operations with built-in authentication and authorization.
☆81Mar 12, 2026Updated 3 months ago
nmukerje / EMR-Hudi-Workshop
View on GitHub
EMR Hudi Workshop content
☆12Dec 10, 2021Updated 4 years ago
wxhC3SC6OPm8M1HXboMy / spark-mrmr-feature-selection
View on GitHub
Machine learning enhancements to Spark MlLib
☆20Mar 19, 2015Updated 11 years ago
vvalcristina / Workshop-Data-Lakehouse
View on GitHub
Repositório dedicado a Workshop de Data Lakehouse com Delta Lake
☆17Dec 6, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aws-samples / aws-dms-msk-demo
View on GitHub
☆18Apr 14, 2023Updated 3 years ago
xfold / LanguageBiasesInReddit
View on GitHub
Repository for the paper "Discovering and Categorising Language Biases in Reddit" accepted at the International Conference on Web and Soc…
☆12Aug 20, 2024Updated last year
CiscoDevNet / sdwan-cor-labinfra
View on GitHub
Set of Terraform scripts to spin up virtual lab infra for Cisco Cloud onRamp (CoR) for Multicloud
☆15Oct 25, 2023Updated 2 years ago
typeclasses / haskell-report-archive
View on GitHub
A collection of old versions of the Haskell Report
☆13Aug 17, 2017Updated 8 years ago
brefphp / costs-calculator
View on GitHub
Serverless costs calculator for AWS Lambda
☆12Oct 21, 2020Updated 5 years ago
aws-samples / browser-control-with-nova-act
View on GitHub
☆21Dec 3, 2025Updated 7 months ago
anna-anisienia / data-discovery-api
View on GitHub
☆15Apr 4, 2021Updated 5 years ago
masfworld / cdc_deltaLake
View on GitHub
Docker compose and Google Colab demo to build a CDC with Delta Lake
☆15Sep 7, 2022Updated 3 years ago
garystafford / dbt-redshift-demo
View on GitHub
dbt / Amazon Redshift Demonstration Project
☆34Jan 3, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
USEPA / EPANET
View on GitHub
EPANET: Graphical User Interface
☆12Mar 18, 2026Updated 3 months ago
ognis1205 / unitycatalog-explorer-legacy
View on GitHub
Unity Catalog Explorer is a TypeScript + Next.js based Web UI for the Unity Catalog OSS.
☆13Jun 29, 2024Updated 2 years ago
newfront / spark-moderndataengineering
View on GitHub
The source code for the book Modern Data Engineering with Apache Spark
☆41Jul 26, 2022Updated 3 years ago
luyomo / OhMyTiUP
View on GitHub
☆11Oct 13, 2025Updated 8 months ago
fog / fog-backblaze
View on GitHub
Integration library for gem fog and Backblaze B2 Cloud Storage
☆21Dec 14, 2020Updated 5 years ago
garystafford / emr-msk-serverless-demo
View on GitHub
Amazon EMR Serverless and Amazon MSK Serverless Demo
☆13Jul 31, 2022Updated 3 years ago
strykerin / Uniswap-dotnet
View on GitHub
A dotnet standard wrapper for the Uniswap V2 Subgraph on The Graph GraphQL API.
☆12Dec 17, 2020Updated 5 years ago
VividCortex / lastseen
View on GitHub
Last-seen sketch implementation in Go
☆16Dec 15, 2020Updated 5 years ago
shravanpn7 / AWS-Cleanup
View on GitHub
These scripts clean the unused EBS volumes, AMIs and snapshots on Amazon Web Services.
☆11Jul 24, 2015Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aws-samples / aws-samples-for-ray
View on GitHub
☆74Jun 26, 2024Updated 2 years ago
codehangar / python-log-parse-example
View on GitHub
Simple log parsing example in Python
☆14Oct 7, 2015Updated 10 years ago
aws-samples / unified-log-aggregation-and-analytics
View on GitHub
The proposed solution shows and approach to unify and centralize logs across different compute platforms like EC2, ECS, EKS and Lambda wi…
☆14Oct 17, 2023Updated 2 years ago
OrigamingWasTaken / neutralino-svelte
View on GitHub
A svelte + neutralino template
☆13Aug 5, 2024Updated last year
marcoil / gottengeography
View on GitHub
High quality and easy to use photo geotagging application for the GNOME desktop.
☆12Jul 13, 2012Updated 13 years ago
aws-samples / aws-dms-sql-server
View on GitHub
Amazon DMS infrastructure with sample SQL server databases.
☆14Nov 4, 2025Updated 8 months ago
RoberWare / pytwinkle
View on GitHub
Twinkle sip client, ported to a python module.
☆17May 31, 2024Updated 2 years ago