apache / sedona
A cluster computing framework for processing large-scale geospatial data
☆1,989Updated this week
Alternatives and similar repositories for sedona:
Users that are interested in sedona are comparing it to the libraries listed below
- GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.☆1,441Updated this week
- Geo Spatial Data Analytics on Spark☆531Updated 3 years ago
- GeoTrellis is a geographic data processing engine for high performance applications.☆1,356Updated 3 weeks ago
- Specification for storing geospatial vector data (point, line, polygon) in Parquet☆875Updated 2 weeks ago
- The Spatial Framework for Hadoop allows developers and data scientists to use the Hadoop data processing system for spatial data analysis…☆369Updated 2 weeks ago
- GeoWave provides geospatial and temporal indexing on top of Accumulo, HBase, BigTable, Cassandra, Kudu, Redis, RocksDB, and DynamoDB.☆505Updated last year
- Template projects for GeoSpark, GeoSpark-SQL, GeoSpark-Viz☆66Updated 4 years ago
- Geospatial Raster support for Spark DataFrames☆249Updated 9 months ago
- The GIS Tools for Hadoop are a collection of GIS tools for spatial analysis of big data.☆521Updated 2 years ago
- The second generation of SpatialHadoop that ships as an extension☆153Updated 2 years ago
- An Open Standard for lineage metadata collection☆1,827Updated this week
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆899Updated 2 months ago
- Specification for storing geospatial data in Apache Arrow☆450Updated 2 weeks ago
- GeoTrellis for PySpark☆180Updated 4 years ago
- ☆1,016Updated last week
- Big Spatial Data Processing using Spark☆145Updated 7 years ago
- Apache Iceberg☆6,832Updated this week
- A curated list of awesome Apache Spark packages and resources.☆1,747Updated 3 months ago
- Python bindings for H3, a hierarchical hexagonal geospatial indexing system☆864Updated this week
- Geospatial extensions for Polars☆657Updated 5 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,131Updated this week
- The Internals of Apache Spark☆1,489Updated 4 months ago
- Build your own Raster dynamic map tile services☆823Updated this week
- Tutorials and examples for working with GeoMesa☆100Updated last week
- Jupyter notebooks for h3-py, a hierarchical hexagonal geospatial indexing system☆270Updated 2 years ago
- A performant binary encoding for geographic data based on flatbuffers☆703Updated 2 weeks ago
- Apache Parquet Java☆2,698Updated this week
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,354Updated last week
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆818Updated 2 months ago
- Essential Spark extensions and helper methods ✨😲☆754Updated 3 months ago