opensearch-project / opensearch-hadoop
☆35Updated this week
Alternatives and similar repositories for opensearch-hadoop
Users that are interested in opensearch-hadoop are comparing it to the libraries listed below
Sorting:
- OpenSearch Benchmark - a community driven, open source project to run performance tests for OpenSearch☆120Updated last week
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆106Updated last month
- Redshift JDBC Driver. It supports JDBC 4.2 specification.☆64Updated 4 months ago
- AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry…☆136Updated 3 months ago
- Apache flink☆68Updated last month
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆115Updated 2 months ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆220Updated last month
- Search Request Processor: pipeline for transformation of queries and results inline with a search request.☆24Updated 3 months ago
- Spark Accelerator framework ; It enables secondary indices to remote data stores.☆35Updated last week
- Performance optimization for Spark running on Kubernetes☆88Updated 4 years ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆63Updated last week
- Amazon EMR on EKS Custom Image CLI☆31Updated 7 months ago
- ☆80Updated 3 weeks ago
- A testing framework for Trino☆26Updated last month
- Collection of code examples for Amazon Managed Service for Apache Flink☆56Updated this week
- Example code for running Spark and Hive jobs on EMR Serverless.☆164Updated 4 months ago
- ☆24Updated last year
- DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.☆60Updated 2 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Updated last year
- Helm charts for Trino and Trino Gateway☆165Updated last week
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆39Updated this week
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- Spark runtime on AWS Lambda☆107Updated 7 months ago
- Neural search transforms text into vectors and facilitates vector search both at ingestion time and at search time.☆82Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆77Updated last month
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆183Updated 3 weeks ago
- ☆20Updated 7 months ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆85Updated 2 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆64Updated last year