☆22Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for apache-iceberg-data-exploration
Users that are interested in apache-iceberg-data-exploration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Oct 10, 2025Updated 6 months ago
- A minimal docker compose setup for experimenting with cloud agnostic Lakehouse Architectures Apache Spark with Hive Metastore + Delta Lak…☆34Apr 17, 2024Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 24, 2026Updated 3 weeks ago
- 🤖 An autonomous AI agent system that collaboratively designs, implements, and manages Apache Airflow DAGs through natural language inter…☆28Aug 6, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The fictici…☆15Sep 30, 2024Updated last year
- An end-to-end, containerized data pipeline for near-real-time user event analytics using Kafka, ClickHouse, Airflow, and PySpark. Made to…☆73Sep 12, 2025Updated 7 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆76Sep 2, 2023Updated 2 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated last year
- Build a data pipeline with Apache Airflow☆11May 7, 2021Updated 4 years ago
- Repo for learning DBT with Snowflake, featuring projects and models for data transformation and automation☆26Mar 31, 2025Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆83Mar 9, 2026Updated last month
- ☆10Jul 21, 2022Updated 3 years ago
- ☆41Jul 4, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆31Updated this week
- ☆16May 29, 2023Updated 2 years ago
- ☆11Mar 7, 2021Updated 5 years ago
- Code to convert static datasets into simulated data streams☆15Apr 6, 2023Updated 3 years ago
- Serverless costs calculator for AWS Lambda☆12Oct 21, 2020Updated 5 years ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Dec 24, 2024Updated last year
- A MkDocs plugin to add bootstrap classes to plan markdown generated tables.☆13Mar 27, 2020Updated 6 years ago
- Data Engineering Projects using Mage.ai as orchestrator☆18Jan 20, 2026Updated 2 months ago
- Presentation themes based on popular syntax themes☆18Oct 30, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Demo of orchestrating Airbyte connections with Prefect☆11Mar 3, 2022Updated 4 years ago
- Hands-on examples to integrate GX data validation in a data pipeline.☆18Mar 16, 2026Updated 3 weeks ago
- ☆11Jan 17, 2024Updated 2 years ago
- ☆12Aug 13, 2024Updated last year
- ELT Data Pipeline implementation in Data Warehousing environment☆30May 2, 2025Updated 11 months ago
- FastAPI ASGI with Django ORM and admin☆15May 15, 2022Updated 3 years ago
- Terraform module for deploying the Prefect Agent on AWS EC2☆13Aug 20, 2025Updated 7 months ago
- Fully automated csv to dashboard pipeline using Terraform, Google Cloud Storage, BigQuery, dbt, Prefect and Looker Studio. Peer ranked …☆45Nov 15, 2024Updated last year
- Generates a tree of an S3 bucket contents☆10Sep 18, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- git push your data stack with Airbyte, Airflow, and dbt - 2022 Airflow Summit☆53May 12, 2023Updated 2 years ago
- Creating a REST API with Python on Synapse Serverless pools using external tables☆12Dec 27, 2021Updated 4 years ago
- Automated ML pipeline with Python, Docker, Luigi, SciKit-Learn and Pandas to predict wine quality ratings☆18May 30, 2020Updated 5 years ago
- Pipeline, warehouse, and visualization tools for investigating the impact of Airbnb short-term rentals on world cities.☆14Jun 9, 2023Updated 2 years ago
- Examples from Rob's Awesome Python Template☆15Apr 6, 2026Updated last week
- Materials for EDP workshops.☆26Dec 10, 2025Updated 4 months ago
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆20Jul 31, 2023Updated 2 years ago