newfront / hitchhikers_guide_to_deltalake_streamingView external linksLinks
Don't Panic. This guide will help you when it feels like the end of the world.
☆30Feb 7, 2026Updated last week
Alternatives and similar repositories for hitchhikers_guide_to_deltalake_streaming
Users that are interested in hitchhikers_guide_to_deltalake_streaming are comparing it to the libraries listed below
Sorting:
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- End to end data pipeline☆22Apr 13, 2025Updated 10 months ago
- Code snippets used in demos recorded for the blog.☆37Jan 17, 2026Updated last month
- Code for my "Efficient Data Processing in SQL" book.☆60Aug 6, 2024Updated last year
- A Python Library to support running data quality rules while the spark job is running⚡☆198Updated this week
- Docker Compose environments for developing modern data platform architectures using Kafka, Flink, Spark, Iceberg, OpenLineage, OpenMetada…☆51Jan 30, 2026Updated 2 weeks ago
- Boilerplate project for MOTW Workshop 2015☆10Mar 3, 2016Updated 9 years ago
- Notebooks to learn Databricks Lakehouse Platform☆40Feb 5, 2026Updated last week
- A partially implemented ODBC driver for the Trino distributed SQL engine☆18Feb 2, 2026Updated 2 weeks ago
- Overview☆11Mar 26, 2021Updated 4 years ago
- Examples on how to make use of DestinE Data Lake services☆14Feb 10, 2026Updated last week
- Adaptive File Source Connector for Spark, optimised for reading from object stores☆15Oct 18, 2022Updated 3 years ago
- A Solara web app template for MapLibre☆16Jan 19, 2026Updated 3 weeks ago
- ☆10May 16, 2022Updated 3 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- ☆15Apr 1, 2025Updated 10 months ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- CKAN extension for data.gov.uk☆12Jan 27, 2026Updated 3 weeks ago
- Helper for handling PySpark DataFrame partition size 📑🎛️☆12Mar 8, 2024Updated last year
- Docker Image - Tadpole DB Hub☆14Jul 28, 2021Updated 4 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- Utilities for cleaning, and processing data for carbonplan/offsets-db-web☆15Updated this week
- Documentation for ALA demo install☆10Jun 30, 2021Updated 4 years ago
- Using rio-tiler-mvt to create Mapbox satellite + Elevation 3d Vector tiles.☆13Jun 24, 2019Updated 6 years ago
- ☆10Aug 23, 2015Updated 10 years ago
- Python library allowing to manipulate data split into a collection of groups stored in Zarr format.☆13Jul 11, 2025Updated 7 months ago
- A Mermaid widget for interactively exploring Mermaid diagrams in notebooks and Panel data apps☆12Oct 25, 2024Updated last year
- OGC 2D Tile Matrix Set & TileSet Metadata standard☆12Jul 31, 2025Updated 6 months ago
- Basic Model Interface for Python☆12Jan 19, 2026Updated 3 weeks ago
- Java client for EventStore (http://geteventstore.com)☆20May 25, 2015Updated 10 years ago
- Field Boundaries for Agriculture (fiboa) - a specification that describes important properties of field boundaries☆15Aug 27, 2025Updated 5 months ago
- Geospatial visulation tools and templates☆12Oct 7, 2024Updated last year
- Using WASM to write UDFs in Apache Spark☆11Jun 3, 2024Updated last year
- Pangeo & OpenEO Joint tutorial for BiDS23 - "Scaling Big Data Analysis with Pangeo and OpenEO: Unlocking the Power of Space Data"☆11Feb 29, 2024Updated last year
- Akka Java cluster singleton example☆10Dec 5, 2023Updated 2 years ago
- ☆25Nov 16, 2025Updated 3 months ago
- ☆10May 2, 2025Updated 9 months ago
- Advanced parsing of structured data using Python's new match statement☆13Jan 15, 2025Updated last year