godatadriven-dockerhub / hive-metastoreLinks
Hadoop/Hive/Spark container to perform CI tests
☆11Updated 4 years ago
Alternatives and similar repositories for hive-metastore
Users that are interested in hive-metastore are comparing it to the libraries listed below
Sorting:
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Set of tools for creating backups, compaction and restoration of Apache Kafka® Clusters☆21Updated this week
- cod-examples☆16Updated 2 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆11Updated 5 years ago
- Receipes of publicly-available Jupyter images☆8Updated 4 months ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Updated 9 months ago
- ☆14Updated 2 years ago
- Stocks -> NiFi -> Kafka -> Profit☆14Updated 6 years ago
- Presto cluster on top of kubernetes☆9Updated 3 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 2 months ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆70Updated 2 years ago
- Ecosystem website for Apache Flink☆12Updated last year
- CDF Tech Bootcamp☆9Updated 5 years ago
- minio as local storage and DynamoDB as catalog☆15Updated last year
- Ambari View for the Ambari Store☆15Updated 9 years ago
- ☆30Updated last month
- HDF masterclass materials☆28Updated 9 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 4 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- HDFS Automatic Snapshot Service for Linux☆12Updated 8 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 6 years ago
- This repository contains a Kafka Connect sink connector for copying data from Apache Kafka into IBM MQ.☆41Updated this week
- Code for the fictitious food delivery company GottaEat used in the Pulsar In Action book☆18Updated 3 years ago
- ☆17Updated 3 years ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆28Updated last week