Spark and Hive docker containers sharing a common MySQL metastore
☆26Apr 17, 2020Updated 6 years ago
Alternatives and similar repositories for docker-spark-hive-metastore
Users that are interested in docker-spark-hive-metastore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆68Feb 3, 2021Updated 5 years ago
- Cloud based Data Platform based on Apache Spark☆28May 21, 2026Updated last week
- Utility functions to support analytics over FHIR in BigQuery or Apache Spark☆15Jan 8, 2024Updated 2 years ago
- Files for the Docker and Kubernetes on Google Cloud Hands-On labs☆11Mar 14, 2023Updated 3 years ago
- ☆11Apr 27, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Jenkins configuration as code docker image☆10Nov 10, 2021Updated 4 years ago
- over-documented #scala code sample for beginner☆21Aug 29, 2018Updated 7 years ago
- ☆16Jun 27, 2020Updated 5 years ago
- KnetBuilder data integration platform for building knowledge graphs. Previously known as ondex.☆15Apr 2, 2026Updated last month
- Tooling to build a custom Confluent Platform Kafka Connect container with additional connectors from Confluent Hub.☆15Oct 26, 2020Updated 5 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Oct 30, 2025Updated 6 months ago
- Playbook to provision a Confluent Cluster☆10Oct 22, 2017Updated 8 years ago
- Extract FHIR data, Transform with NLP and DEID tools, and then Load FHIR data into a SQL Database for analysis☆23May 15, 2026Updated 2 weeks ago
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Easily deploy airflow infrastructure on an AWS VPC using terraform.☆11Apr 9, 2019Updated 7 years ago
- A Firebase Cloud Function and a Firebase hosted web app to treat weather data collected by Cloud IoT Core☆18Mar 10, 2019Updated 7 years ago
- trino monitoring with JMX metrics through Prometheus and Grafana☆17Aug 14, 2024Updated last year
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- hackintosh 13900 macos☆12Jun 23, 2023Updated 2 years ago
- An example Terraform repo that utilizes the upstream EKS blueprints project from AWS Integration and Automation.☆14May 11, 2022Updated 4 years ago
- Delta-Lake, ETL, Spark, Airflow☆49Oct 9, 2022Updated 3 years ago
- Automated basic infrastructure to intall OKD4 on free ESXi☆13Aug 8, 2020Updated 5 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- Short Range Ultrasonic Radar - A simple radar using the ultrasonic sensor, this radar works by measuring a range from 3cm to 40 cm as non…☆19Nov 11, 2024Updated last year
- Spark-based pipeline to extract and parse monthly games from the Lichess database.☆21Sep 22, 2025Updated 8 months ago
- Supporting repository composed of examples using the mass-ts library. MASS (Mueen's Algorithm for Similarity Search)☆15Aug 20, 2019Updated 6 years ago
- Spark on Docker Swarm example code☆11Nov 27, 2016Updated 9 years ago
- Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink☆28Oct 9, 2023Updated 2 years ago
- docker-hadoop-spark-hive 快速构建你的大数据环境☆21Jan 4, 2020Updated 6 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 3 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Example of orchestrating dependent Databricks jobs using Airflow☆11Dec 19, 2019Updated 6 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- GitHub "AI-Brain-of-Brains" created from (11,400+) hand picked GitHub Repos, Providing advanced search capability for Repos with specific…☆24Oct 4, 2018Updated 7 years ago
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- API REST boilerplate using Spring Boot and Redis as database☆13Dec 26, 2018Updated 7 years ago
- A set of tools to roll out your own hadoop distro.☆15Apr 21, 2018Updated 8 years ago
- A process manager written in C++ and Rust.☆14Oct 26, 2022Updated 3 years ago