opensearch-project / opensearch-sparkLinks
Spark Accelerator framework ; It enables secondary indices to remote data stores.
☆37Updated this week
Alternatives and similar repositories for opensearch-spark
Users that are interested in opensearch-spark are comparing it to the libraries listed below
Sorting:
- OpenSearch Benchmark - a community driven, open source project to run performance tests for OpenSearch☆125Updated this week
- Query your data using familiar SQL or intuitive Piped Processing Language (PPL)☆147Updated last week
- Identify atypical data and receive automatic notifications☆76Updated last week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆137Updated last week
- ☆37Updated last week
- ☆40Updated 2 years ago
- Apache datasketches☆98Updated 2 years ago
- Multi-hop declarative data pipelines☆118Updated this week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- ☆25Updated last year
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆235Updated 3 weeks ago
- Point-in-Time optimizations for Apache Spark☆30Updated last year
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆52Updated last week
- Spline agent for Apache Spark☆196Updated this week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆282Updated last week
- 🆕 Find the k-nearest neighbors (k-NN) for your vector data☆193Updated last week
- Apache Calcite Adapter for Apache Kudu☆28Updated 10 months ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated last year
- Apache Hive Metastore as a Standalone server in Docker☆79Updated last year
- Apache iceberg Spark s3 examples☆20Updated last year
- Neural search transforms text into vectors and facilitates vector search both at ingestion time and at search time.☆93Updated last week
- Lakehouse storage system benchmark☆76Updated 2 years ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆224Updated 5 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- A BYOC option for Snowflake workloads☆88Updated this week
- ☆220Updated this week
- Search Request Processor: pipeline for transformation of queries and results inline with a search request.☆26Updated 7 months ago
- Apache flink☆22Updated last month
- An implementation of the Random Cut Forest data structure for sketching streaming data, with support for anomaly detection, density estim…☆228Updated 3 months ago
- ml-commons provides a set of common machine learning algorithms, e.g. k-means, or linear regression, to help developers build ML related …☆127Updated last week