opensearch-project / opensearch-sparkLinks
Spark Accelerator framework ; It enables secondary indices to remote data stores.
☆38Updated this week
Alternatives and similar repositories for opensearch-spark
Users that are interested in opensearch-spark are comparing it to the libraries listed below
Sorting:
- Query your data using familiar SQL or intuitive Piped Processing Language (PPL)☆153Updated this week
- ☆39Updated last week
- OpenSearch Benchmark - a community driven, open source project to run performance tests for OpenSearch☆132Updated last week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆141Updated 3 months ago
- Search Request Processor: pipeline for transformation of queries and results inline with a search request.☆26Updated last month
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆57Updated this week
- 🗃 Automate periodic data operations, such as deleting indices at a certain age or performing a rollover at a certain size☆68Updated this week
- Distributed SQL query engine for big data☆51Updated last week
- Neural search transforms text into vectors and facilitates vector search both at ingestion time and at search time.☆101Updated last week
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆136Updated 2 years ago
- ☆25Updated last year
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆233Updated 9 months ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆67Updated 4 months ago
- ml-commons provides a set of common machine learning algorithms, e.g. k-means, or linear regression, to help developers build ML related …☆131Updated this week
- Spline agent for Apache Spark☆199Updated last week
- Apache flink☆22Updated 3 months ago
- ☆233Updated 2 weeks ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆292Updated this week
- Storage connector for Trino☆116Updated last week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Updated 2 years ago
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆56Updated 2 years ago
- ☆26Updated 4 months ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆226Updated 7 months ago
- Identify atypical data and receive automatic notifications☆84Updated this week
- Migrate, upgrade, reconfigure, and replicate OpenSearch clusters with ease.☆62Updated this week
- ☆70Updated 10 months ago
- Apache flink☆73Updated 4 months ago
- Amundsen Gremlin☆21Updated 3 years ago
- Multi-hop declarative data pipelines☆122Updated last week
- Performance optimization for Spark running on Kubernetes☆89Updated 5 years ago