opensearch-project / opensearch-sparkLinks
Spark Accelerator framework ; It enables secondary indices to remote data stores.
☆39Updated last week
Alternatives and similar repositories for opensearch-spark
Users that are interested in opensearch-spark are comparing it to the libraries listed below
Sorting:
- OpenSearch Benchmark - a community driven, open source project to run performance tests for OpenSearch☆136Updated this week
- Query your data using familiar SQL or intuitive Piped Processing Language (PPL)☆157Updated this week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆145Updated 5 months ago
- ☆44Updated this week
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆65Updated 2 months ago
- Search Request Processor: pipeline for transformation of queries and results inline with a search request.☆26Updated 3 months ago
- ☆25Updated last year
- Apache datasketches☆103Updated last month
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆136Updated 2 years ago
- Apache flink☆24Updated 3 weeks ago
- 🗃 Automate periodic data operations, such as deleting indices at a certain age or performing a rollover at a certain size☆72Updated this week
- ☆81Updated 8 months ago
- ☆32Updated last month
- ☆95Updated this week
- Multi-hop declarative data pipelines☆122Updated last week
- ☆240Updated this week
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆265Updated this week
- Identify atypical data and receive automatic notifications☆85Updated 2 weeks ago
- Trino connectors for accessing APIs with an OpenAPI spec☆41Updated this week
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆65Updated this week
- Apache DataLab (incubating)☆152Updated 2 years ago
- Spline agent for Apache Spark☆201Updated last month
- Lance Namespace is an open specification for describing access and operations against a collection of tables in a multimodal lakehouse☆45Updated this week
- Idempotent query executor☆53Updated 8 months ago
- ☆70Updated last year
- Apache Iceberg Documentation Site☆42Updated last year
- Apache flink☆75Updated last week
- Presto Trino with Apache Hive Postgres metastore☆43Updated last year
- The Performance Analyzer RCA is a framework that builds on the Performance Analyzer engine to support root cause analysis (RCA) of perfor…☆33Updated 2 months ago
- Neural search transforms text into vectors and facilitates vector search both at ingestion time and at search time.☆105Updated this week