godatadriven-dockerhub / hive-metastore
Hadoop/Hive/Spark container to perform CI tests
☆11Updated 4 years ago
Alternatives and similar repositories for hive-metastore:
Users that are interested in hive-metastore are comparing it to the libraries listed below
- Yet Another (Spark) ETL Framework☆20Updated last year
- ☆14Updated last month
- ☆52Updated 7 months ago
- ☆15Updated last year
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- A sample project for KSQL along with debezium and kafka connect☆15Updated 2 years ago
- Presto cluster on top of kubernetes☆9Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated this week
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- ☆11Updated last year
- Examples of Spark 3.0☆47Updated 4 years ago
- Magic to help Spark pipelines upgrade☆34Updated 5 months ago
- ☆27Updated 2 months ago
- minio as local storage and DynamoDB as catalog☆13Updated 10 months ago
- Ecosystem website for Apache Flink☆11Updated last year
- Presto Trino with Apache Hive Postgres metastore☆40Updated 6 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 11 months ago
- ☆13Updated last year
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Receipes of publicly-available Jupyter images☆8Updated last week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆10Updated 2 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- This project contains a couple of tools to analyze data around the Apache Flink community.☆18Updated 9 months ago
- ☆17Updated 2 years ago