Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
☆47Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for modern-data-lake-storage-layers
Users that are interested in modern-data-lake-storage-layers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides y…☆16Nov 22, 2022Updated 3 years ago
- This repository provides the resources required for the Amazon Redshift Streaming workshop☆13Apr 13, 2026Updated 3 weeks ago
- ☆20Jan 19, 2024Updated 2 years ago
- 🌳 A sustainable Terraform Package which creates resources for Data Services on AWS☆14Feb 25, 2026Updated 2 months ago
- ☆18Jun 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🌉 Reference implementation for granting cross-account AWS Glue Data Catalog access from Amazon Athena☆30Jul 25, 2022Updated 3 years ago
- ☆32Jan 30, 2026Updated 3 months ago
- Modernize seu Data Warehouse☆15Nov 12, 2024Updated last year
- A Caddy server module that provides a REST API for DuckDB database operations with built-in authentication and authorization.☆81Mar 12, 2026Updated last month
- Sample datasets and code for operationalizing Amazon Fraud Detector using SageMaker DataWrangler, Feature Store, and Pipelines.☆18Dec 1, 2022Updated 3 years ago
- ☆39Jun 1, 2022Updated 3 years ago
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆17Dec 6, 2021Updated 4 years ago
- ☆18Apr 14, 2023Updated 3 years ago
- Template for a modular, Python-based data science project.☆41Apr 9, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for the paper "Discovering and Categorising Language Biases in Reddit" accepted at the International Conference on Web and Soc…☆12Aug 20, 2024Updated last year
- Set of Terraform scripts to spin up virtual lab infra for Cisco Cloud onRamp (CoR) for Multicloud☆15Oct 25, 2023Updated 2 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- this repogitory describe how to use avro-tools☆12Feb 21, 2018Updated 8 years ago
- A BigQuery adapter for Harlequin, a SQL IDE for the terminal.☆10Jan 19, 2025Updated last year
- Example code for running Spark and Hive jobs on EMR Serverless.☆169Apr 30, 2026Updated last week
- ☆15Apr 4, 2021Updated 5 years ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- dbt / Amazon Redshift Demonstration Project☆34Jan 3, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The source code for the book Modern Data Engineering with Apache Spark☆40Jul 26, 2022Updated 3 years ago
- A Kivy tutorial for PyOhio 2013☆14Apr 30, 2014Updated 12 years ago
- ☆11Oct 13, 2025Updated 6 months ago
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Jul 31, 2022Updated 3 years ago
- GeoNode is an open source platform that facilitates the creation, sharing, and collaborative use of geospatial data.☆17Sep 13, 2019Updated 6 years ago
- ☆12Aug 17, 2023Updated 2 years ago
- "유닉스 리눅스 셸 스크립트 예제 사전: Unix & Linux Shell Script Exercise Dictionary" - 한빛미디어☆10Jan 17, 2017Updated 9 years ago
- Last-seen sketch implementation in Go☆16Dec 15, 2020Updated 5 years ago
- This repo demonstrates how to use AWS application auto-scaling to implement custom-scaling in your Kinesis Data Analytics for Apache Flin…☆19Feb 21, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- These scripts clean the unused EBS volumes, AMIs and snapshots on Amazon Web Services.☆11Jul 24, 2015Updated 10 years ago
- The classic desktop version of osDQ☆10Jun 30, 2022Updated 3 years ago
- ☆74Jun 26, 2024Updated last year
- A svelte + neutralino template☆13Aug 5, 2024Updated last year
- Simple log parsing example in Python☆14Oct 7, 2015Updated 10 years ago
- The proposed solution shows and approach to unify and centralize logs across different compute platforms like EC2, ECS, EKS and Lambda wi…☆14Oct 17, 2023Updated 2 years ago
- ☆25Oct 12, 2023Updated 2 years ago