Spark and Hive docker containers sharing a common MySQL metastore
☆26Apr 17, 2020Updated 6 years ago
Alternatives and similar repositories for docker-spark-hive-metastore
Users that are interested in docker-spark-hive-metastore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆67Feb 3, 2021Updated 5 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Jan 31, 2023Updated 3 years ago
- Cloud based Data Platform based on Apache Spark☆28May 21, 2026Updated 3 weeks ago
- Ingestion of bid requests through Amazon Kinesis Firehose and Kinesis Data Analytics. Data lake storage with Amazon S3. Restitution with …☆26Dec 10, 2022Updated 3 years ago
- Docker build project to setup a lightweight hadoop cluster containing hadoop, pig, zookeeper, hbase, phoenix, storm, kafka, kafka manager☆23Jun 17, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Spark + Jupyer + Hive☆16Sep 22, 2015Updated 10 years ago
- Jenkins configuration as code docker image☆10Nov 10, 2021Updated 4 years ago
- A docker image with a pre-configured Hive Metastore and a Spark ThriftServer☆19Jan 20, 2020Updated 6 years ago
- over-documented #scala code sample for beginner☆21Aug 29, 2018Updated 7 years ago
- Companion repository for the "WebSockets and AsyncIO: Beyond 5-line Samples" blog post☆13Mar 27, 2022Updated 4 years ago
- Tooling to build a custom Confluent Platform Kafka Connect container with additional connectors from Confluent Hub.☆15Oct 26, 2020Updated 5 years ago
- SQL on HBase with Apache Phoenix in Docker☆29Mar 21, 2016Updated 10 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Oct 30, 2025Updated 7 months ago
- Playbook to provision a Confluent Cluster☆10Oct 22, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Oct 11, 2022Updated 3 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 3 months ago
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- An example Terraform repo that utilizes the upstream EKS blueprints project from AWS Integration and Automation.☆14May 11, 2022Updated 4 years ago
- Delta-Lake, ETL, Spark, Airflow☆49Oct 9, 2022Updated 3 years ago
- 通过supervisor启动kubernetes各个组件☆11Jan 6, 2016Updated 10 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- Spark-based pipeline to extract and parse monthly games from the Lichess database.☆22Sep 22, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Objects and Animals detection with Wifi camera and Yolo☆19Apr 28, 2024Updated 2 years ago
- Reading rosbag files in pure Rust☆14May 27, 2024Updated 2 years ago
- Example of orchestrating dependent Databricks jobs using Airflow☆11Dec 19, 2019Updated 6 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- GitHub "AI-Brain-of-Brains" created from (11,400+) hand picked GitHub Repos, Providing advanced search capability for Repos with specific…☆25Oct 4, 2018Updated 7 years ago
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16May 21, 2026Updated 3 weeks ago
- API REST boilerplate using Spring Boot and Redis as database☆13Dec 26, 2018Updated 7 years ago
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- A Kafka metric sink for Apache Spark☆11Apr 13, 2017Updated 9 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 9 years ago
- This repository contains icebreaker examples for migen.☆12May 24, 2026Updated 3 weeks ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago
- The project implemented some machine learning algorithms on spark which is written in scala and it also included standalone implementatio…☆16Jan 3, 2022Updated 4 years ago
- Reinforcement Learning Algorithms☆14May 28, 2018Updated 8 years ago