☆113Jan 15, 2025Updated last year
Alternatives and similar repositories for definitive-guide-to-apache-iceberg
Users that are interested in definitive-guide-to-apache-iceberg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides y…☆16Nov 22, 2022Updated 3 years ago
- repo with resources from Understanding Data with Alex Merced videos☆14Jan 20, 2024Updated 2 years ago
- "Nature's economy shall be the base for our own, for it is immutable, but ours is secondary. An economist without knowledge of nature is …☆20May 31, 2021Updated 4 years ago
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)☆232Oct 3, 2022Updated 3 years ago
- The source code for the book Modern Data Engineering with Apache Spark☆39Jul 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- My collection of various config files for Linux/macOS☆12Dec 9, 2025Updated 3 months ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- Code snippets for Data Engineering Design Patterns book☆363Feb 16, 2026Updated last month
- Serverless ETL and Analytics with AWS Glue, published by Packt☆53Mar 2, 2026Updated 3 weeks ago
- AWS MWAA Quick Start With Terraform (Private Web Server).☆14May 2, 2021Updated 4 years ago
- Model Context Protocol (MCP) server to interact with gRPC services using the grpcurl tool☆16Mar 5, 2025Updated last year
- RAG application (backend & frontend) with sources retriveal and highlighting on the Databricks Platform☆17Apr 29, 2025Updated 11 months ago
- Using WASM to write UDFs in Apache Spark☆12Jun 3, 2024Updated last year
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,886Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Practical Machine Learning on Databricks, published by packt☆24Mar 2, 2026Updated 3 weeks ago
- Septima Search for Spatial Suite☆11Jan 24, 2026Updated 2 months ago
- This Guidance helps you implement server-side tagging to collect event data and perform data analysis in near real-time.☆15Feb 13, 2025Updated last year
- Apache Iceberg☆8,661Updated this week
- SpiceDB Client Generator for Java☆11Nov 21, 2025Updated 4 months ago
- Practical DevOps Second Edition, published by Packt☆13Jan 30, 2023Updated 3 years ago
- Quick Guides from Dremio on Several topics☆82Mar 19, 2026Updated last week
- Simple Social Agents powered by Claude Agent SDK☆22Feb 28, 2026Updated last month
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Mar 9, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- BSR's new public API. Currently in development.☆21Jan 26, 2026Updated 2 months ago
- ☆65Aug 6, 2024Updated last year
- Spark app to merge different schemas☆23Dec 21, 2020Updated 5 years ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- ☆12Feb 1, 2022Updated 4 years ago
- ☆10Jan 15, 2022Updated 4 years ago
- Building a Q&A app (powered by a LLM model) using AWS Bedrock, AWS Kendra, AWS S3 and Streamlit in just a couple of hours☆17Dec 7, 2023Updated 2 years ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,442Updated this week
- Cosine Similary Search in ElasticSearch + FAISS GPU☆12Mar 24, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Unity Catalog AI Model Context Protocol Server☆16Mar 28, 2025Updated last year
- ☆190Updated this week
- An end-to-end workflow for processing streaming data on Azure.☆17Sep 20, 2024Updated last year
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆84Apr 12, 2025Updated 11 months ago
- Chess AI with Spring AI☆12Feb 25, 2025Updated last year
- Apache Iceberg - Go☆397Updated this week
- ☆36Mar 2, 2026Updated 3 weeks ago