The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an extern…
☆229May 18, 2026Updated last week
Alternatives and similar repositories for aws-glue-data-catalog-client-for-apache-hive-metastore
Users that are interested in aws-glue-data-catalog-client-for-apache-hive-metastore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Dec 5, 2023Updated 2 years ago
- Apache Spark build compatible with AWS Glue Data Catalog.☆19Aug 9, 2021Updated 4 years ago
- Docker image for running Spark 3 on Kubernetes on AWS☆26May 26, 2021Updated 5 years ago
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆701Apr 24, 2026Updated last month
- The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…☆201Jun 15, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AWS Glue code samples☆1,532Nov 5, 2025Updated 6 months ago
- ☆12Aug 9, 2024Updated last year
- Spline agent for Apache Spark☆202May 21, 2026Updated last week
- ☆14Feb 26, 2024Updated 2 years ago
- Docker image that builds a patched Apache Spark with AWS Glue support as metastore☆18Jun 8, 2024Updated last year
- Amazon EMR on EKS Custom Image CLI☆32Sep 26, 2024Updated last year
- Building an ETL process using Spark EMR in AWS☆10Jun 27, 2019Updated 6 years ago
- A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service☆13Mar 26, 2026Updated 2 months ago
- 🌉 Reference implementation for granting cross-account AWS Glue Data Catalog access from Amazon Athena☆30Jul 25, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Data Lineage Tracking And Visualization Solution☆658May 20, 2026Updated last week
- Iceberg is a table format for large, slow-moving tabular data☆493Apr 10, 2023Updated 3 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,618May 22, 2026Updated last week
- ☆11Oct 11, 2022Updated 3 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆286Feb 24, 2026Updated 3 months ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16May 21, 2026Updated last week
- Example code for running Spark and Hive jobs on EMR Serverless.☆170May 14, 2026Updated 2 weeks ago
- ☆23May 2, 2024Updated 2 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,820Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Spark Atlas connector to track data lineage in Apache Atlas☆268Nov 16, 2022Updated 3 years ago
- pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoD…☆4,112May 19, 2026Updated last week
- Jupyter magics and kernels for working with remote Spark clusters☆1,361Sep 9, 2025Updated 8 months ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆306Oct 30, 2025Updated 6 months ago
- ☆71May 8, 2026Updated 3 weeks ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆39Feb 17, 2025Updated last year
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Helper library to run AWS Glue ETL scripts docker container for local testing of development in a Jupyter notebook☆20Feb 13, 2024Updated 2 years ago
- ☆13Feb 19, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB☆228Apr 8, 2026Updated last month
- Glue scripts for converting AWS Service Logs for use in Athena☆139Feb 1, 2024Updated 2 years ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆317Updated this week
- This is a fork of the Apache Flink Kinesis connector adding Enhanced Fanout support for Flink 1.8/1.11 on KDA.☆24Mar 1, 2026Updated 2 months ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆91Dec 29, 2022Updated 3 years ago
- The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.☆610May 20, 2026Updated last week
- A load balancer / proxy / gateway for prestodb☆358Jul 25, 2024Updated last year