☆22Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for apache-iceberg-data-exploration
Users that are interested in apache-iceberg-data-exploration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Oct 10, 2025Updated 5 months ago
- python library for iceberg lake house on your local☆14Jan 8, 2026Updated 2 months ago
- A minimal docker compose setup for experimenting with cloud agnostic Lakehouse Architectures Apache Spark with Hive Metastore + Delta Lak…☆34Apr 17, 2024Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 3, 2024Updated 2 years ago
- repo with resources from Understanding Data with Alex Merced videos☆14Jan 20, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The fictici…☆14Sep 30, 2024Updated last year
- all-in-one-docker-bigdataops is a comprehensive Docker Compose environment that simplifies Big Data operations by bundling Hadoop, Spark,…☆21Feb 9, 2025Updated last year
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆75Sep 2, 2023Updated 2 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated last year
- Repo for learning DBT with Snowflake, featuring projects and models for data transformation and automation☆25Mar 31, 2025Updated 11 months ago
- Build a data pipeline with Apache Airflow☆11May 7, 2021Updated 4 years ago
- Capture the logical plan from Spark (SQL)☆22Mar 6, 2021Updated 5 years ago
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Jul 31, 2022Updated 3 years ago
- Tools for Microsoft Fabric☆25Jul 17, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10Jul 21, 2022Updated 3 years ago
- ☆41Jul 4, 2022Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Mar 9, 2026Updated 2 weeks ago
- ☆16May 29, 2023Updated 2 years ago
- ☆11Mar 7, 2021Updated 5 years ago
- Awesome Data Engineering☆20Feb 3, 2025Updated last year
- Serverless costs calculator for AWS Lambda☆12Oct 21, 2020Updated 5 years ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Dec 24, 2024Updated last year
- On-premises ELT Pipeline☆31Jul 10, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆50Dec 2, 2023Updated 2 years ago
- A MkDocs plugin to add bootstrap classes to plan markdown generated tables.☆13Mar 27, 2020Updated 5 years ago
- Data Engineering Projects using Mage.ai as orchestrator☆18Jan 20, 2026Updated 2 months ago
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- Hands-on examples to integrate GX data validation in a data pipeline.☆18Mar 16, 2026Updated last week
- ☆12Aug 13, 2024Updated last year
- ELT Data Pipeline implementation in Data Warehousing environment☆30May 2, 2025Updated 10 months ago
- Delta-Lake, ETL, Spark, Airflow☆48Oct 9, 2022Updated 3 years ago
- Terraform module for deploying the Prefect Agent on AWS EC2☆13Aug 20, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Generates a tree of an S3 bucket contents☆10Sep 18, 2020Updated 5 years ago
- Visual Studio Code Server on Azure Web App for Containers☆10Apr 12, 2019Updated 6 years ago
- git push your data stack with Airbyte, Airflow, and dbt - 2022 Airflow Summit☆53May 12, 2023Updated 2 years ago
- Flink Example☆17Nov 19, 2023Updated 2 years ago
- Pipeline, warehouse, and visualization tools for investigating the impact of Airbnb short-term rentals on world cities.☆14Jun 9, 2023Updated 2 years ago
- Examples from Rob's Awesome Python Template☆15Mar 16, 2026Updated last week
- Materials for EDP workshops.☆26Dec 10, 2025Updated 3 months ago