amesar/docker-spark-hive-metastore

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amesar/docker-spark-hive-metastore)

amesar / docker-spark-hive-metastore

Spark and Hive docker containers sharing a common MySQL metastore

☆26

Alternatives and similar repositories for docker-spark-hive-metastore

Users that are interested in docker-spark-hive-metastore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

panovvv / hadoop-hive-spark-docker
View on GitHub
Base Docker image with just essentials: Hadoop, Hive and Spark.
☆67Feb 3, 2021Updated 5 years ago
arempter / hive-metastore-docker
View on GitHub
Example for article Running Spark 3 with standalone Hive Metastore 3.0
☆100Jan 31, 2023Updated 3 years ago
homeaway / datapull
View on GitHub
Cloud based Data Platform based on Apache Spark
☆28Jun 30, 2026Updated 3 weeks ago
hervenivon / aws-experiments-data-ingestion-and-analytics
View on GitHub
Ingestion of bid requests through Amazon Kinesis Firehose and Kinesis Data Analytics. Data lake storage with Amazon S3. Restitution with …
☆26Dec 10, 2022Updated 3 years ago
anair-it / hadoop-docker-lite
View on GitHub
Docker build project to setup a lightweight hadoop cluster containing hadoop, pig, zookeeper, hbase, phoenix, storm, kafka, kafka manager
☆23Jun 17, 2017Updated 9 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
google / fhir-dbt-utils
View on GitHub
Utility functions to support analytics over FHIR in BigQuery or Apache Spark
☆15Jan 8, 2024Updated 2 years ago
bryanyang0528 / docker-spark-hive-ipython
View on GitHub
Spark + Jupyer + Hive
☆16Sep 22, 2015Updated 10 years ago
K8sAcademy / GoogleCloud-HandsOn
View on GitHub
Files for the Docker and Kubernetes on Google Cloud Hands-On labs
☆11Mar 14, 2023Updated 3 years ago
japerry911 / crypto-data-pipeline
View on GitHub
Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.
☆10Jan 23, 2023Updated 3 years ago
lewisbarber / public-chat-room
View on GitHub
A public chat room built using Java Spring Framework 4, Web Sockets and STOMP messaging.
☆10Feb 24, 2016Updated 10 years ago
juli1 / scala-cookbook
View on GitHub
over-documented #scala code sample for beginner
☆21Aug 29, 2018Updated 7 years ago
emnify / jenkins-casc-docker
View on GitHub
Jenkins configuration as code docker image
☆10Nov 10, 2021Updated 4 years ago
sebolabs / eks-tf-gitops
View on GitHub
A fully functional and secure EKS cluster provisioned with Terraform and powered by ArgoCD
☆12Jun 14, 2023Updated 3 years ago
davef77 / RefactoringBadCode
View on GitHub
Starting point for an exercise in refactoring bad code
☆14Mar 23, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
sequenceiq / docker-phoenix
View on GitHub
SQL on HBase with Apache Phoenix in Docker
☆29Mar 21, 2016Updated 10 years ago
robcowart / cp-kafka-connect-custom
View on GitHub
Tooling to build a custom Confluent Platform Kafka Connect container with additional connectors from Confluent Hub.
☆15Oct 26, 2020Updated 5 years ago
jetbrains-infra / terraform-aws-spot-fleet
View on GitHub
AWS Spot fleet terraform module
☆11Apr 26, 2019Updated 7 years ago
awslabs / amazon-s3-tagging-spark-util
View on GitHub
☆12Oct 16, 2023Updated 2 years ago
hant121 / shortrangeradar
View on GitHub
Short Range Ultrasonic Radar - A simple radar using the ultrasonic sensor, this radar works by measuring a range from 3cm to 40 cm as non…
☆19Nov 11, 2024Updated last year
Rothamsted / knetbuilder
View on GitHub
KnetBuilder data integration platform for building knowledge graphs. Previously known as ondex.
☆16Apr 2, 2026Updated 3 months ago
dimajix / terraform-emr-training
View on GitHub
Terraform script for launching multiple EMR clusters for training purposes.
☆16Oct 30, 2025Updated 8 months ago
jenciso / confluent-cluster
View on GitHub
Playbook to provision a Confluent Cluster
☆10Oct 22, 2017Updated 8 years ago
newfront / spark-intro-to-ml
View on GitHub
A Gentle introduction to Machine Learning with Apache Spark
☆11Mar 2, 2026Updated 4 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
aws-samples / amazon-emr-optimize-data-processing
View on GitHub
Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark
☆14Apr 14, 2023Updated 3 years ago
joomcode / spark-platform
View on GitHub
Basic Spark utilities
☆13Updated this week
yukiti2007 / sample
View on GitHub
☆11Jun 27, 2023Updated 3 years ago
vincentclaes / glue-devcontainer
View on GitHub
Glue VSCode devcontainer setup
☆14Jan 31, 2023Updated 3 years ago
IgorJanos / WikiToPdf
View on GitHub
Python tool to help export Azure DevOps WIKI into a single PDF
☆10May 10, 2020Updated 6 years ago
Stefen-Taime / stream-ingestion-redpanda-minio
View on GitHub
In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…
☆11Jun 27, 2023Updated 3 years ago
jamartinh / Orange3-Spark
View on GitHub
A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML
☆15Dec 24, 2016Updated 9 years ago
kiwenlau / kubernetes-supervisor
View on GitHub
通过supervisor启动kubernetes各个组件
☆11Jan 6, 2016Updated 10 years ago
TribalNightOwl / okd4-esxi-infra
View on GitHub
Automated basic infrastructure to intall OKD4 on free ESXi
☆13Aug 8, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Stefen-Taime / modern-data-pipeline
View on GitHub
reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.
☆15Jun 26, 2023Updated 3 years ago
superorbital / aws-eks-blueprint-examples
View on GitHub
An example Terraform repo that utilizes the upstream EKS blueprints project from AWS Integration and Automation.
☆14May 11, 2022Updated 4 years ago
mark-hoffmann / fastteradata
View on GitHub
Tools for faster and optimized interaction with Teradata and large datasets.
☆17Jul 11, 2018Updated 8 years ago
enkhalifapro / bigdata-all-in-one
View on GitHub
Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink
☆28Oct 9, 2023Updated 2 years ago
aocenas / spark-docker-swarm
View on GitHub
Spark on Docker Swarm example code
☆11Nov 27, 2016Updated 9 years ago
ibywind / docker-hadoop-spark-hive
View on GitHub
docker-hadoop-spark-hive 快速构建你的大数据环境
☆21Jan 4, 2020Updated 6 years ago
ExpediaGroup / apiary-data-lake
View on GitHub
Terraform scripts for deploying Apiary Data Lake
☆19Apr 16, 2026Updated 3 months ago