a curated list of awesome lakehouse frameworks, applications, etc
☆42Feb 9, 2026Updated last month
Alternatives and similar repositories for awesome-lakehouse
Users that are interested in awesome-lakehouse are comparing it to the libraries listed below
Sorting:
- Monitoring and insights on your data lakehouse tables☆32Updated this week
- ☆11Nov 26, 2024Updated last year
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆268Feb 18, 2026Updated 2 weeks ago
- Point-in-Time optimizations for Apache Spark☆30Jan 18, 2024Updated 2 years ago
- Use pyarrow with Azure Data Lake gen2☆28Jun 27, 2024Updated last year
- FederatedCatalog☆12Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Apr 12, 2025Updated 10 months ago
- Python Package to Share/Edit Pandas/Polars DF with web interface!☆11Jun 10, 2025Updated 8 months ago
- Hadoop/Hive/Spark container to perform CI tests☆10Dec 26, 2020Updated 5 years ago
- How to customize Tableau authentication using the AWS Athena's JDBC Credentials Provider capabilites.☆14Jun 8, 2020Updated 5 years ago
- Facilitates collaboration and governance for all participants in a Data Space.☆13Feb 27, 2026Updated last week
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆46Dec 14, 2025Updated 2 months ago
- Looker Access is a command line tool to control Looker roles, groups, permission sets and model sets.☆10Apr 20, 2019Updated 6 years ago
- This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, …☆17Feb 5, 2026Updated last month
- Crossplane upjet provider for Confluent Cloud: https://registry.terraform.io/providers/confluentinc/confluent/latest/docs☆13Jan 20, 2026Updated last month
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,210Updated this week
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated 10 months ago
- Demo application on how to create a serverless realtime analytics application using Kinesis Data Streams, Kinesis Firehose, DynamoDB and …☆14Dec 4, 2020Updated 5 years ago
- Proof Of Concept - Open Patient Pathway Generator using and an agent based approach☆11Apr 4, 2023Updated 2 years ago
- Infra stuff to run Kubernetes on travisci☆10Mar 7, 2023Updated 3 years ago
- Provides time series data and metadata as Apache Arrow.☆16Updated this week
- A benchmark tool for lakehouses.☆14Mar 12, 2023Updated 2 years ago
- Load testing for event analytics platforms (Snowplow, more coming soon)☆13May 17, 2016Updated 9 years ago
- Pager for tabular data and SQL output☆12Mar 29, 2023Updated 2 years ago
- A GitHub repository associated with paper "Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet"☆10Jun 22, 2020Updated 5 years ago
- Run an open-source data LakeHouse locally using Docker Compose☆12May 31, 2024Updated last year
- FMI for Power System☆10Sep 6, 2019Updated 6 years ago
- Pacote para adicionar dias úteis a uma data de referência ou verificar se determinada data é dia útil ou não e permite capturar uma lista…☆10Dec 8, 2025Updated 3 months ago
- A broker quota plugin for Apache Kafka® to allow setting a per-broker limits statically in the broker configuration☆16Feb 3, 2026Updated last month
- Repository for the OAC (ODRL profile for Access Control) documentation: https://w3id.org/oac☆10Oct 20, 2024Updated last year
- Repository of the metadata specification mobilityDCAT-AP☆18Feb 18, 2026Updated 2 weeks ago
- Latest: 7.0.0 - Lightweight and ready-to-use services to easily connect an IDS-Connector to different IDS-Infrastructure-Components.☆14Mar 4, 2024Updated 2 years ago
- Policy Administration point to handle ODRL policies and provide their Rego-equivalent to the Open Policy Agent☆11Feb 23, 2026Updated 2 weeks ago
- the Sampa group website☆10Oct 1, 2021Updated 4 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- Atlassian Bamboo and Bitbucket images for GKE clusters☆10Mar 24, 2022Updated 3 years ago
- reclaim your stuff from social media silos☆47Jan 13, 2015Updated 11 years ago