opensearch-project / opensearch-spark
Spark Accelerator framework ; It enables secondary indices to remote data stores.
☆35Updated last week
Alternatives and similar repositories for opensearch-spark
Users that are interested in opensearch-spark are comparing it to the libraries listed below
Sorting:
- OpenSearch Benchmark - a community driven, open source project to run performance tests for OpenSearch☆120Updated last week
- Query your data using familiar SQL or intuitive Piped Processing Language (PPL)☆140Updated this week
- Search Request Processor: pipeline for transformation of queries and results inline with a search request.☆24Updated 3 months ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Updated last year
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆63Updated last week
- ☆35Updated this week
- Apache Iceberg Documentation Site☆42Updated last year
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆115Updated 2 months ago
- ☆24Updated last year
- Trino Connector for Apache Paimon.☆33Updated 3 weeks ago
- Apache flink☆23Updated 6 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆106Updated last month
- ☆80Updated 3 weeks ago
- Storage connector for Trino☆110Updated last week
- Offers a library of utilities for building Java-based OpenSearch plugins☆22Updated last week
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆220Updated last month
- ☆84Updated this week
- Spline agent for Apache Spark☆190Updated this week
- Identify atypical data and receive automatic notifications☆72Updated last week
- Official workloads used by OpenSearch Benchmark (OSB)☆24Updated this week
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆39Updated this week
- Performance optimization for Spark running on Kubernetes☆88Updated 4 years ago
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆24Updated this week
- Apache flink☆68Updated last month
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 4 months ago
- Apache datasketches☆95Updated 2 years ago
- Apache Pinot Documentation☆25Updated this week
- ☆24Updated last year
- The Performance Analyzer RCA is a framework that builds on the Performance Analyzer engine to support root cause analysis (RCA) of perfor…☆32Updated 2 weeks ago