apache / sedona
A cluster computing framework for processing large-scale geospatial data
☆1,958Updated this week
Related projects ⓘ
Alternatives and complementary repositories for sedona
- GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.☆1,433Updated this week
- GeoTrellis is a geographic data processing engine for high performance applications.☆1,342Updated this week
- Geo Spatial Data Analytics on Spark☆532Updated 3 years ago
- Specification for storing geospatial vector data (point, line, polygon) in Parquet☆829Updated 3 weeks ago
- GeoWave provides geospatial and temporal indexing on top of Accumulo, HBase, BigTable, Cassandra, Kudu, Redis, RocksDB, and DynamoDB.☆502Updated last year
- The Spatial Framework for Hadoop allows developers and data scientists to use the Hadoop data processing system for spatial data analysis…☆368Updated 8 months ago
- Geospatial Raster support for Spark DataFrames☆246Updated 7 months ago
- The GIS Tools for Hadoop are a collection of GIS tools for spatial analysis of big data.☆521Updated 2 years ago
- Template projects for GeoSpark, GeoSpark-SQL, GeoSpark-Viz☆65Updated 3 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆889Updated last week
- Specification for storing geospatial data in Apache Arrow☆427Updated this week
- The second generation of SpatialHadoop that ships as an extension☆153Updated last year
- Apache Iceberg☆6,473Updated this week
- GeoTrellis for PySpark☆179Updated 4 years ago
- Tutorials and examples for working with GeoMesa☆99Updated last month
- Apache Parquet Java☆2,642Updated this week
- ☆487Updated last week
- Apache Parquet Format☆1,810Updated last week
- Upserts, Deletes And Incremental Processing on Big Data.☆5,444Updated this week
- Essential Spark extensions and helper methods ✨😲☆754Updated 3 weeks ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,010Updated 2 years ago
- Big Spatial Data Processing using Spark☆145Updated 7 years ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,040Updated this week
- Qubole Sparklens tool for performance tuning Apache Spark☆568Updated 4 months ago
- A Spark plugin for reading and writing Excel files☆468Updated 2 weeks ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,212Updated this week
- Geospatial extensions for Polars☆627Updated 2 months ago
- Spatial In-Memory Big data Analytics☆121Updated 5 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,105Updated this week