Don't Panic. This guide will help you when it feels like the end of the world.
☆30Feb 7, 2026Updated last month
Alternatives and similar repositories for hitchhikers_guide_to_deltalake_streaming
Users that are interested in hitchhikers_guide_to_deltalake_streaming are comparing it to the libraries listed below
Sorting:
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- ☆61Feb 1, 2025Updated last year
- The source code for the book Modern Data Engineering with Apache Spark☆39Jul 26, 2022Updated 3 years ago
- End to end data pipeline☆22Apr 13, 2025Updated 10 months ago
- The Internals of PySpark☆27Dec 29, 2024Updated last year
- A Python Library to support running data quality rules while the spark job is running⚡☆200Updated this week
- Docker Compose environments for developing modern data platform architectures using Kafka, Flink, Spark, Iceberg, OpenLineage, OpenMetada…☆51Jan 30, 2026Updated last month
- Boilerplate project for MOTW Workshop 2015☆10Mar 3, 2016Updated 10 years ago
- Notebooks to learn Databricks Lakehouse Platform☆41Feb 16, 2026Updated 3 weeks ago
- A partially implemented ODBC driver for the Trino distributed SQL engine☆18Feb 2, 2026Updated last month
- Examples on how to make use of DestinE Data Lake services☆14Feb 20, 2026Updated 2 weeks ago
- Overview☆11Mar 26, 2021Updated 4 years ago
- Free tool to copy CSVs from https://chartink.com/☆15Sep 7, 2025Updated 6 months ago
- ☆16Apr 1, 2025Updated 11 months ago
- ☆10May 16, 2022Updated 3 years ago
- Adaptive File Source Connector for Spark, optimised for reading from object stores☆15Oct 18, 2022Updated 3 years ago
- An R package that facilitates accessing the aWhere Ag Intel Platform using R☆12Nov 1, 2021Updated 4 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- ☆11Apr 17, 2024Updated last year
- A Solara web app template for MapLibre☆16Feb 23, 2026Updated 2 weeks ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆46Feb 4, 2026Updated last month
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 5 years ago
- SDK for Conduit connectors written in Go☆12Updated this week
- Helper for handling PySpark DataFrame partition size 📑🎛️☆12Mar 8, 2024Updated 2 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Netty handler for receiving files over FTP☆14Oct 24, 2013Updated 12 years ago
- Java code for Apache Nifi processors☆11Jun 5, 2017Updated 8 years ago
- Python library allowing to manipulate data split into a collection of groups stored in Zarr format.☆13Jul 11, 2025Updated 7 months ago
- A Mermaid widget for interactively exploring Mermaid diagrams in notebooks and Panel data apps☆12Oct 25, 2024Updated last year
- Advanced parsing of structured data using Python's new match statement☆13Jan 15, 2025Updated last year
- Field Boundaries for Agriculture (fiboa) - a specification that describes important properties of field boundaries☆15Aug 27, 2025Updated 6 months ago
- The official Rock the JVM Akka Persistence Starter project☆11Apr 4, 2019Updated 6 years ago
- Using rio-tiler-mvt to create Mapbox satellite + Elevation 3d Vector tiles.☆13Jun 24, 2019Updated 6 years ago
- Using WASM to write UDFs in Apache Spark☆12Jun 3, 2024Updated last year
- OGC 2D Tile Matrix Set & TileSet Metadata standard☆12Jul 31, 2025Updated 7 months ago
- Flowchart for debugging Spark applications☆106Sep 25, 2024Updated last year
- Data Exploration Using Spark 2.0☆14Apr 17, 2018Updated 7 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago