Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
☆30Mar 4, 2026Updated this week
Alternatives and similar repositories for nessie-demos
Users that are interested in nessie-demos are comparing it to the libraries listed below
Sorting:
- Fybrik platform - Arrow/Flight module☆15Aug 10, 2024Updated last year
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆18Jul 31, 2023Updated 2 years ago
- ☆10Jun 3, 2023Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆31Apr 13, 2023Updated 2 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- ☆13Oct 4, 2023Updated 2 years ago
- A platform to manage the data product life cycle☆22Feb 11, 2026Updated 3 weeks ago
- ☆14Dec 8, 2022Updated 3 years ago
- Transporter for integrating OpenLineage with OpenMetadata☆17Sep 10, 2025Updated 5 months ago
- Data Engineering Projects using Mage.ai as orchestrator☆19Jan 20, 2026Updated last month
- Mirror of Apache MetaModel Membrane☆16Jun 4, 2019Updated 6 years ago
- A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides y…☆16Nov 22, 2022Updated 3 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated last year
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆46Dec 14, 2025Updated 2 months ago
- ☆22Feb 5, 2024Updated 2 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Oct 8, 2025Updated 5 months ago
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- ☆21Aug 26, 2025Updated 6 months ago
- An experiment in visualizing your Solr index via term counts, document counts, and memory usage per field and data type.☆15Feb 13, 2015Updated 11 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Apr 12, 2025Updated 10 months ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆76Feb 15, 2023Updated 3 years ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,430Updated this week
- An experiment to inject a customized parser using SparkSessionExtension☆16Jan 1, 2018Updated 8 years ago
- ☆22Jul 2, 2025Updated 8 months ago
- "Nature's economy shall be the base for our own, for it is immutable, but ours is secondary. An economist without knowledge of nature is …☆20May 31, 2021Updated 4 years ago
- Apache Hive Metastore as a Standalone server in Docker☆80Aug 22, 2024Updated last year
- ☆23May 2, 2024Updated last year
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- ☆97Mar 2, 2026Updated last week
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆120Mar 5, 2025Updated last year
- Repo which holds the materials for the EMR Zero To Hero☆27May 7, 2022Updated 3 years ago
- Istio is not just for Microservices: Secure your Kubernetes services using Istio Service Mesh☆19Aug 1, 2018Updated 7 years ago
- DuckDB DuckLake Demos☆38Jun 1, 2025Updated 9 months ago
- OpsCenter for Snowflake makes it easy to understand and manage your Snowflake consumption☆24May 15, 2024Updated last year
- XML for Analysis (XMLA) server based upon an olap4j connection☆23Dec 8, 2016Updated 9 years ago
- Full stack data engineering tools and infrastructure set-up☆57Feb 13, 2021Updated 5 years ago
- ☆376Updated this week