☆23Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for apache-iceberg-data-exploration
Users that are interested in apache-iceberg-data-exploration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Oct 10, 2025Updated 7 months ago
- python library for iceberg lake house on your local☆14Jan 8, 2026Updated 4 months ago
- A minimal docker compose setup for experimenting with cloud agnostic Lakehouse Architectures Apache Spark with Hive Metastore + Delta Lak…☆34Apr 17, 2024Updated 2 years ago
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 24, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🤖 An autonomous AI agent system that collaboratively designs, implements, and manages Apache Airflow DAGs through natural language inter…☆28Aug 6, 2025Updated 9 months ago
- repo with resources from Understanding Data with Alex Merced videos☆14Jan 20, 2024Updated 2 years ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆78Sep 2, 2023Updated 2 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆84Mar 9, 2026Updated 2 months ago
- Capture the logical plan from Spark (SQL)☆22Mar 6, 2021Updated 5 years ago
- Repo for learning DBT with Snowflake, featuring projects and models for data transformation and automation☆26Mar 31, 2025Updated last year
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Jul 31, 2022Updated 3 years ago
- Tools for Microsoft Fabric☆25Jul 17, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆41Jul 4, 2022Updated 3 years ago
- pip installable duckdb extensions published to pypi☆43May 2, 2026Updated 3 weeks ago
- ☆16May 29, 2023Updated 2 years ago
- ☆11Mar 7, 2021Updated 5 years ago
- Serverless costs calculator for AWS Lambda☆12Oct 21, 2020Updated 5 years ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Dec 24, 2024Updated last year
- Case study describing Red Hat Marketing Operations use of Luigi on top of Openshift☆12Apr 17, 2017Updated 9 years ago
- A MkDocs plugin to add bootstrap classes to plan markdown generated tables.☆13Mar 27, 2020Updated 6 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆52Dec 2, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- Demo of orchestrating Airbyte connections with Prefect☆11Mar 3, 2022Updated 4 years ago
- ☆12Aug 13, 2024Updated last year
- ELT Data Pipeline implementation in Data Warehousing environment☆30May 2, 2025Updated last year
- Generates a tree of an S3 bucket contents☆11Sep 18, 2020Updated 5 years ago
- Fully automated csv to dashboard pipeline using Terraform, Google Cloud Storage, BigQuery, dbt, Prefect and Looker Studio. Peer ranked …☆46Nov 15, 2024Updated last year
- Visual Studio Code Server on Azure Web App for Containers☆10Apr 12, 2019Updated 7 years ago
- git push your data stack with Airbyte, Airflow, and dbt - 2022 Airflow Summit☆53May 12, 2023Updated 3 years ago
- Creating a REST API with Python on Synapse Serverless pools using external tables☆12Dec 27, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is a sample for installing Kubernetes on Bare metals Production servers ( Ubuntu distro )☆10Jan 9, 2021Updated 5 years ago
- Automated ML pipeline with Python, Docker, Luigi, SciKit-Learn and Pandas to predict wine quality ratings☆18May 30, 2020Updated 5 years ago
- Flink Example☆17Nov 19, 2023Updated 2 years ago
- Python manager for spark-submit jobs☆10Jan 6, 2024Updated 2 years ago
- Pipeline, warehouse, and visualization tools for investigating the impact of Airbnb short-term rentals on world cities.☆14Jun 9, 2023Updated 2 years ago
- Online Simultaneous Localization and Mapping in ROS☆11Jan 31, 2019Updated 7 years ago
- Implement D*Lite and A* Algorithm on Processing environment☆11Apr 7, 2017Updated 9 years ago