opensearch-project / opensearch-hadoop
☆31Updated this week
Alternatives and similar repositories for opensearch-hadoop:
Users that are interested in opensearch-hadoop are comparing it to the libraries listed below
- Apache flink☆60Updated last month
- AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry…☆133Updated 2 months ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆34Updated last year
- Performance optimization for Spark running on Kubernetes☆85Updated 4 years ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆62Updated last week
- Offers a library of utilities for building Java-based OpenSearch plugins☆20Updated 3 weeks ago
- Spark Structured Streaming Kinesis Data Streams connector supports both GetRecords and SubscribeToShard (Enhanced Fan-Out, EFO)☆28Updated last month
- Search Request Processor: pipeline for transformation of queries and results inline with a search request.☆23Updated last week
- OpenSearch Benchmark - a community driven, open source project to run performance tests for OpenSearch☆115Updated this week
- ☆24Updated 4 months ago
- Amazon EMR on EKS Custom Image CLI☆26Updated 3 months ago
- This is a fork of the Apache Flink Kinesis connector adding Enhanced Fanout support for Flink 1.8/1.11 on KDA.☆22Updated last year
- ☆23Updated 10 months ago
- Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.☆141Updated 7 months ago
- ☆13Updated 2 months ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆210Updated 8 months ago
- Performance Testing Framework for Apache Kafka☆46Updated last year
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 9 months ago
- ☆79Updated last year
- Aiven's OpenSearch® Connector for Apache Kafka®☆69Updated 2 weeks ago
- Plugin that adds dense neural retrieval into the OpenSearch ecosytem☆69Updated this week
- DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.☆58Updated last year
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated last year
- Redshift JDBC Driver. It supports JDBC 4.2 specification.☆64Updated 3 weeks ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆70Updated this week
- Amazon Managed Service for Apache Flink Benchmarking Utility helps with capacity planning, integration testing, and benchmarking of Amazo…☆20Updated last year
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆82Updated last week
- ml-commons provides a set of common machine learning algorithms, e.g. k-means, or linear regression, to help developers build ML related …☆103Updated this week