Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations should be performed.
☆10Jul 31, 2023Updated 2 years ago
Alternatives and similar repositories for lighthouse
Users that are interested in lighthouse are comparing it to the libraries listed below
Sorting:
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Delta Acceptance Testing☆23Aug 25, 2025Updated 6 months ago
- Lokalise API v2 official Python library☆15Mar 13, 2026Updated last week
- ☆59Jan 3, 2024Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Calendar tables for Power BI with TMDL or M code.☆41Jun 10, 2025Updated 9 months ago
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- A Delta Lake reader for Dask☆53Jul 29, 2025Updated 7 months ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆98Mar 17, 2025Updated last year
- Modeling directed acyclic graphs (DAG) for topological sorting, shortest path, longest path, etc.☆14Sep 1, 2017Updated 8 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- Icons (PNG and SVG) for Power BI☆36Jul 5, 2023Updated 2 years ago
- PySpark schema generator☆44Feb 23, 2023Updated 3 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- ☆26Feb 22, 2026Updated 3 weeks ago
- Write property based tests easily on spark dataframes☆20Jan 19, 2024Updated 2 years ago
- [WORK IN PROGRESS] Create STAC Items from vector datasets☆10Dec 11, 2023Updated 2 years ago
- Go library for efficient skyline queries☆18Aug 17, 2025Updated 7 months ago
- Add geo functionality extension to datafusion query engine.☆11Apr 26, 2024Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆29Jul 7, 2022Updated 3 years ago
- Python library allowing to manipulate data split into a collection of groups stored in Zarr format.☆13Jul 11, 2025Updated 8 months ago
- ☆18Feb 11, 2023Updated 3 years ago
- Automatically convert a stream of tile coordinates to another format☆12Apr 7, 2023Updated 2 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- A Rust port of kdbush, a fast static spatial index for 2D points.☆12Sep 27, 2022Updated 3 years ago
- A small clone of Planetary Computer items, served with stac-fastapi, browsable with stac-browser, backed by pgstac☆12Jan 10, 2023Updated 3 years ago
- Delta lake and filesystem helper methods☆50Feb 29, 2024Updated 2 years ago
- mercury-graph is a Python library that offers graph analytics capabilities with a technology-agnostic API.☆38Mar 21, 2025Updated 11 months ago
- Exploring modern RESTful services for gridded data☆12May 8, 2022Updated 3 years ago
- A VS Code extension to show the latest version of a dependency in pyproject.toml or requirements.txt☆13Sep 2, 2023Updated 2 years ago
- ☆15Apr 2, 2024Updated last year
- ☆12Sep 19, 2022Updated 3 years ago
- The ecosystem of geospatial machine learning tools in the Pangeo world.☆12Mar 17, 2025Updated last year
- Coming soon☆63Nov 9, 2023Updated 2 years ago
- Example playbooks for Ansible☆56Nov 6, 2015Updated 10 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 4 years ago
- Let Pydantic and Shapely work together!☆18Jan 27, 2026Updated last month