☆113Jan 15, 2025Updated last year
Alternatives and similar repositories for definitive-guide-to-apache-iceberg
Users that are interested in definitive-guide-to-apache-iceberg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for Reference for Apache Iceberg LinkedIN Learning Courses☆18Jan 27, 2025Updated last year
- ☆14Dec 11, 2023Updated 2 years ago
- repo with resources from Understanding Data with Alex Merced videos☆14Jan 20, 2024Updated 2 years ago
- "Nature's economy shall be the base for our own, for it is immutable, but ours is secondary. An economist without knowledge of nature is …☆20May 31, 2021Updated 4 years ago
- This repository contains the source code of the examples provided in the book "Fundamentals of Data Observability" edited by O'Reilly and…☆10Aug 4, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a duckdb extension for querying encoded protobuf messages☆30Jul 21, 2025Updated 9 months ago
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)☆234Oct 3, 2022Updated 3 years ago
- The source code for the book Modern Data Engineering with Apache Spark☆40Jul 26, 2022Updated 3 years ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- Code snippets for Data Engineering Design Patterns book☆380Feb 16, 2026Updated 2 months ago
- Serverless ETL and Analytics with AWS Glue, published by Packt☆53Apr 22, 2026Updated 2 weeks ago
- AWS MWAA Quick Start With Terraform (Private Web Server).☆14May 2, 2021Updated 5 years ago
- Query Iceberg in Trino, Nessie as Catalog, and use minio to replace AWS S3☆27Aug 7, 2025Updated 9 months ago
- A template showing datalake pipelines using the serverless architecture☆11Apr 12, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Spark cluster in docker containers with sample training Jupyter notebooks☆26Feb 24, 2023Updated 3 years ago
- IceDB S3 Proxy to trick S3 clients into only seeing alive files☆13Dec 24, 2023Updated 2 years ago
- Using WASM to write UDFs in Apache Spark☆12Jun 3, 2024Updated last year
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,927Updated this week
- Practical Machine Learning on Databricks, published by packt☆23Mar 2, 2026Updated 2 months ago
- ☆17Jul 31, 2024Updated last year
- This Guidance helps you implement server-side tagging to collect event data and perform data analysis in near real-time.☆15Apr 13, 2026Updated 3 weeks ago
- Apache Iceberg☆8,817Updated this week
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository is for the LinkedIn Learning course Creating an Open-Source Project in Python☆11Apr 3, 2023Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆31Updated this week
- ☆66Aug 6, 2024Updated last year
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- dbtVault + Greenplum demo☆11Feb 19, 2024Updated 2 years ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,453May 2, 2026Updated last week
- Source Code for 'The Definitive Guide to AWS Application Integration' by Jyothi Prasad Buddha and Reshma Beesetty☆14May 7, 2023Updated 3 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 24, 2026Updated last month
- Cosine Similary Search in ElasticSearch + FAISS GPU☆12Mar 24, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Unity Catalog AI Model Context Protocol Server☆16Mar 28, 2025Updated last year
- An end-to-end workflow for processing streaming data on Azure.☆17Sep 20, 2024Updated last year
- ☆204Updated this week
- ☆37Mar 2, 2026Updated 2 months ago
- Apache Iceberg - Go☆406Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆85Apr 12, 2025Updated last year
- Learn how you can use the power of cloud services for your own machine learning and artificial intelligence projects☆13Oct 31, 2018Updated 7 years ago