Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
☆47Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for modern-data-lake-storage-layers
Users that are interested in modern-data-lake-storage-layers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides y…☆16Nov 22, 2022Updated 3 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Apr 27, 2025Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆69Sep 23, 2023Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆49Oct 9, 2022Updated 3 years ago
- Auto-fixing error due to version upgrade, good practice etc.☆11Sep 5, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16May 9, 2022Updated 4 years ago
- ☆18Jun 16, 2024Updated 2 years ago
- Examples and Quick Starts for Snowflake☆11Jun 9, 2026Updated last week
- ☆11Apr 27, 2021Updated 5 years ago
- 🌉 Reference implementation for granting cross-account AWS Glue Data Catalog access from Amazon Athena☆30Jul 25, 2022Updated 3 years ago
- ☆32Jan 30, 2026Updated 4 months ago
- A Caddy server module that provides a REST API for DuckDB database operations with built-in authentication and authorization.☆81Mar 12, 2026Updated 3 months ago
- Hybrid Search (BM25 & Vector) with SQLite☆33Aug 13, 2024Updated last year
- Sample datasets and code for operationalizing Amazon Fraud Detector using SageMaker DataWrangler, Feature Store, and Pipelines.☆18Dec 1, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- EMR Hudi Workshop content☆12Dec 10, 2021Updated 4 years ago
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆17Dec 6, 2021Updated 4 years ago
- Template for a modular, Python-based data science project.☆41Apr 9, 2024Updated 2 years ago
- docs, codes and resources to prepare for the CRT020: Databricks Certified Associate Developer for Apache Spark 2.4 with Python 3 certific…☆10Sep 25, 2019Updated 6 years ago
- Proyecto de la serie de tutoriales de FastAPI en el que aprenderemos a crear una API desde 0 y paso a paso☆18Dec 5, 2021Updated 4 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- A collection of old versions of the Haskell Report☆13Aug 17, 2017Updated 8 years ago
- this repogitory describe how to use avro-tools☆12Feb 21, 2018Updated 8 years ago
- Serverless costs calculator for AWS Lambda☆12Oct 21, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A BigQuery adapter for Harlequin, a SQL IDE for the terminal.☆11Jan 19, 2025Updated last year
- ☆21Dec 3, 2025Updated 6 months ago
- A FHIR implementation guide that supports conversion of data from FHIR to OMOP and OMOP to FHIR☆16Jun 11, 2026Updated last week
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- Converts DICOM Resources to FHIR Resources☆22Nov 18, 2020Updated 5 years ago
- Apache Spark 3 - Structured Streaming Course Material☆46Sep 8, 2020Updated 5 years ago
- GeoNode is an open source platform that facilitates the creation, sharing, and collaborative use of geospatial data.☆17Sep 13, 2019Updated 6 years ago
- ☆12Aug 17, 2023Updated 2 years ago
- These scripts clean the unused EBS volumes, AMIs and snapshots on Amazon Web Services.☆11Jul 24, 2015Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The classic desktop version of osDQ☆10Jun 30, 2022Updated 3 years ago
- Simple log parsing example in Python☆14Oct 7, 2015Updated 10 years ago
- A svelte + neutralino template☆13Aug 5, 2024Updated last year
- ☆25Oct 12, 2023Updated 2 years ago
- ☆16Sep 4, 2023Updated 2 years ago
- High quality and easy to use photo geotagging application for the GNOME desktop.☆12Jul 13, 2012Updated 13 years ago
- Mirror of Apache Flink☆10May 12, 2022Updated 4 years ago