Spark and Hive docker containers sharing a common MySQL metastore
☆26Apr 17, 2020Updated 6 years ago
Alternatives and similar repositories for docker-spark-hive-metastore
Users that are interested in docker-spark-hive-metastore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- A memory visualisation simulator written in JavaFX☆53Oct 13, 2020Updated 5 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Jan 31, 2023Updated 3 years ago
- Docker build project to setup a lightweight hadoop cluster containing hadoop, pig, zookeeper, hbase, phoenix, storm, kafka, kafka manager☆23Jun 17, 2017Updated 8 years ago
- Spark + Jupyer + Hive☆16Sep 22, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Utility functions to support analytics over FHIR in BigQuery or Apache Spark☆15Jan 8, 2024Updated 2 years ago
- Jenkins configuration as code docker image☆10Nov 10, 2021Updated 4 years ago
- A docker image with a pre-configured Hive Metastore and a Spark ThriftServer☆19Jan 20, 2020Updated 6 years ago
- over-documented #scala code sample for beginner☆21Aug 29, 2018Updated 7 years ago
- deep learning related articles☆11May 27, 2021Updated 4 years ago
- Tooling to build a custom Confluent Platform Kafka Connect container with additional connectors from Confluent Hub.☆15Oct 26, 2020Updated 5 years ago
- SQL on HBase with Apache Phoenix in Docker☆29Mar 21, 2016Updated 10 years ago
- Collection of Interesting Algorithms☆16Oct 13, 2020Updated 5 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Oct 30, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- reflectoring blog☆42Aug 6, 2024Updated last year
- Playbook to provision a Confluent Cluster☆10Oct 22, 2017Updated 8 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- A fully functional and secure EKS cluster provisioned with Terraform and powered by ArgoCD☆12Jun 14, 2023Updated 2 years ago
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- Easily deploy airflow infrastructure on an AWS VPC using terraform.☆11Apr 9, 2019Updated 7 years ago
- Java IDE Pack for VS Code - All Awesome extentions☆12Oct 12, 2018Updated 7 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- A Docker container with a full Hadoop cluster setup with Spark and Zeppelin☆67Feb 2, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Delta-Lake, ETL, Spark, Airflow☆49Oct 9, 2022Updated 3 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink☆29Oct 9, 2023Updated 2 years ago
- Spark on Docker Swarm example code☆11Nov 27, 2016Updated 9 years ago
- docker-hadoop-spark-hive 快速构建你的大数据环境☆21Jan 4, 2020Updated 6 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 3 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- Example of orchestrating dependent Databricks jobs using Airflow☆11Dec 19, 2019Updated 6 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 6 months ago
- API REST boilerplate using Spring Boot and Redis as database☆13Dec 26, 2018Updated 7 years ago
- Terraform scripts for deploying Apiary Data Lake☆19Mar 16, 2026Updated last month
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 9 years ago
- This repository contains icebreaker examples for migen.☆12Jan 4, 2019Updated 7 years ago
- The project implemented some machine learning algorithms on spark which is written in scala and it also included standalone implementatio…☆16Jan 3, 2022Updated 4 years ago