developer-advocacy-dremio/definitive-guide-to-apache-iceberg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/developer-advocacy-dremio/definitive-guide-to-apache-iceberg)

developer-advocacy-dremio / definitive-guide-to-apache-iceberg

☆119

Alternatives and similar repositories for definitive-guide-to-apache-iceberg

Users that are interested in definitive-guide-to-apache-iceberg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

developer-advocacy-dremio / iceberg-modules-repo
View on GitHub
Repository for Reference for Apache Iceberg LinkedIN Learning Courses
☆17Jan 27, 2025Updated last year
josephmachado / iceberg-features
View on GitHub
☆14Dec 11, 2023Updated 2 years ago
bluishglc / serverless-datalake-example
View on GitHub
A serverless datalake project and framework based on AWS S3，Glue，Athena，MWAA and QuickSight. With a series of best practices, it guides y…
☆16Nov 22, 2022Updated 3 years ago
Fundamentals-of-Data-Observability / oreilly-fodo-source-code
View on GitHub
This repository contains the source code of the examples provided in the book "Fundamentals of Data Observability" edited by O'Reilly and…
☆10Aug 4, 2023Updated 2 years ago
manjunath5496 / Apache-Flink-Papers
View on GitHub
"Nature's economy shall be the base for our own, for it is immutable, but ours is secondary. An economist without knowledge of nature is …
☆20May 31, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
0xcaff / duckdb_protobuf
View on GitHub
a duckdb extension for querying encoded protobuf messages
☆30Jul 21, 2025Updated last year
developer-advocacy-dremio / iceberg-intro-lessons
View on GitHub
Repo for Introduction to Iceberg Video
☆22Jun 3, 2024Updated 2 years ago
lakekeeper / console
View on GitHub
A leightweight UI for Lakekeeper
☆19Updated this week
udacity / agentic-ai-c4-exercises-demos
View on GitHub
This repository contains code for exercises and demos for C4 in the Agentic AI ND.
☆17Jul 7, 2026Updated 3 weeks ago
trinodb / trino-the-definitive-guide
View on GitHub
Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)
☆242Oct 3, 2022Updated 3 years ago
ognis1205 / delta-hub
View on GitHub
A platform and cloud-based service for data sharing based on the Delta Sharing protocol.
☆21Jun 12, 2024Updated 2 years ago
PacktPublishing / Serverless-ETL-and-Analytics-with-AWS-Glue
View on GitHub
Serverless ETL and Analytics with AWS Glue, published by Packt
☆53Apr 22, 2026Updated 3 months ago
PacktPublishing / Driving-Data-Quality-with-Data-Contracts
View on GitHub
☆40Mar 2, 2026Updated 4 months ago
bartosz25 / data-generator-blogging-platform
View on GitHub
☆16Mar 2, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
edbullen / DockerSpark245
View on GitHub
Spark cluster in docker containers with sample training Jupyter notebooks
☆26Feb 24, 2023Updated 3 years ago
wirelessr / trino-iceberg-playground
View on GitHub
Query Iceberg in Trino, Nessie as Catalog, and use minio to replace AWS S3
☆27Aug 7, 2025Updated 11 months ago
PacktPublishing / Azure-Data-Factory-Cookbook-Second-Edition
View on GitHub
Azure Data Factory Cookbook_Second Edition, published by Packt
☆19Feb 29, 2024Updated 2 years ago
bartosz25 / data-engineering-design-patterns-book
View on GitHub
Code snippets for Data Engineering Design Patterns book
☆413Jun 13, 2026Updated last month
apache / polaris
View on GitHub
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
☆2,022Updated this week
eadgbear / spark-wasm-udf
View on GitHub
Using WASM to write UDFs in Apache Spark
☆12Jun 3, 2024Updated 2 years ago
Apress / definitive-guide-to-aws-application-integration
View on GitHub
Source Code for 'The Definitive Guide to AWS Application Integration' by Jyothi Prasad Buddha and Reshma Beesetty
☆14May 7, 2023Updated 3 years ago
EcZachly / microbatch-hourly-deduped-tutorial
View on GitHub
☆126Jul 24, 2025Updated last year
mydgd / snowflake-table-catalog
View on GitHub
Streamlit application to explore Snowflake Tables
☆51Oct 28, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cordon-thiago / spark-schema-merge
View on GitHub
Spark app to merge different schemas
☆23Dec 21, 2020Updated 5 years ago
PacktPublishing / Practical-DevOps-Second-Edition
View on GitHub
Practical DevOps Second Edition, published by Packt
☆13Jan 30, 2023Updated 3 years ago
projectnessie / nessie-demos
View on GitHub
Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
☆32Updated this week
apache / iceberg-python
View on GitHub
PyIceberg
☆1,102Updated this week
JamesMcGuigan / elasticsearch-faiss-cosine-similarity-search
View on GitHub
Cosine Similary Search in ElasticSearch + FAISS GPU
☆12Mar 24, 2022Updated 4 years ago
databricks / docker-spark-iceberg
View on GitHub
☆383Feb 15, 2026Updated 5 months ago
developer-advocacy-dremio / quick-guides-from-dremio
View on GitHub
Quick Guides from Dremio on Several topics
☆89May 11, 2026Updated 2 months ago
aws-solutions-library-samples / guidance-for-using-google-tag-manager-for-server-side-website-analytics-on-aws
View on GitHub
This Guidance helps you implement server-side tagging to collect event data and perform data analysis in near real-time.
☆15Apr 13, 2026Updated 3 months ago
duckdb / duckdb_httpfs_wasm_experiment
View on GitHub
HTTPFS extension for DuckDB. Adds support for an HTTPFileSytem and S3FileSystem.
☆18Nov 4, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
newfront / spark-intro-to-ml
View on GitHub
A Gentle introduction to Machine Learning with Apache Spark
☆11Mar 2, 2026Updated 4 months ago
ssp-data / personal-swiss-finance-dw
View on GitHub
Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it
☆27Mar 24, 2026Updated 4 months ago
fpgmaas / stream-iot
View on GitHub
An end-to-end workflow for processing streaming data on Azure.
☆17Sep 20, 2024Updated last year
polyzos / stream-processing-with-apache-flink
View on GitHub
☆67Aug 6, 2024Updated last year
projectnessie / iceberg-catalog-migrator
View on GitHub
CLI tool to bulk migrate the tables from one catalog another without a data copy
☆85Apr 12, 2025Updated last year
projectnessie / nessie
View on GitHub
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,484Updated this week
TrainingByPackt / Machine-Learning-with-AWS
View on GitHub
Learn how you can use the power of cloud services for your own machine learning and artificial intelligence projects
☆13Oct 31, 2018Updated 7 years ago