Base Docker image with just essentials: Hadoop, Hive and Spark.
☆69Feb 3, 2021Updated 5 years ago
Alternatives and similar repositories for hadoop-hive-spark-docker
Users that are interested in hadoop-hive-spark-docker are comparing it to the libraries listed below
Sorting:
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆172Feb 4, 2021Updated 5 years ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 5 years ago
- Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn☆51Dec 7, 2020Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆11Apr 30, 2022Updated 3 years ago
- A Kubernetes CD using CronJobs and kubecfg☆16Nov 10, 2017Updated 8 years ago
- A consumer of a Kafka topic based on Flink☆12Oct 5, 2022Updated 3 years ago
- Reading rosbag files in pure Rust☆14May 27, 2024Updated last year
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Jul 20, 2021Updated 4 years ago
- Explore the use of different patterns to produce clean code☆21Oct 11, 2014Updated 11 years ago
- Docker file for Hadoop 3☆19Apr 21, 2018Updated 7 years ago
- The Proxima platform.☆22Jan 23, 2026Updated last month
- Code for docker images☆39Apr 12, 2023Updated 2 years ago
- Apache Hive☆13Jan 3, 2021Updated 5 years ago
- Quickly set up a POC environment for Kafka+Spark☆15Oct 10, 2017Updated 8 years ago
- Docker image for Apache Hive Metastore☆73Apr 18, 2023Updated 2 years ago
- Playground: Kotlin, Spring Boot, REST JAX-RS, Sprind Data JPA, Spring Data REST, Apache Cassandra, Tests with Spock, Gradle Kotlin Script☆19Jan 14, 2017Updated 9 years ago
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆699Oct 1, 2020Updated 5 years ago
- scrapper for various science databases☆11Sep 14, 2023Updated 2 years ago
- Scala MongoDB query builder☆25Jul 1, 2019Updated 6 years ago
- The repository for the new Rock the JVM blog☆20Nov 14, 2024Updated last year
- 源码主要用于学习:1. Spring Boot+Hadoop+Hive+Hbase实现数据基本操作,Hive数据源使用Alibaba DruidDataSource,以及JDBCTemplate操作数据, Hbase使用hbase-client实现数据操作, API可视化界…☆22Jul 27, 2021Updated 4 years ago
- ☆146Apr 21, 2022Updated 3 years ago
- A Python PySpark Projet with Poetry☆27Feb 17, 2026Updated 2 weeks ago
- Smart pet feeder using object detection and ESP32-cam☆22May 19, 2024Updated last year
- hadoop-spark-hive-cluster-docker☆52Nov 3, 2017Updated 8 years ago
- Docker image for Apache Hive running on Tez☆25Apr 24, 2015Updated 10 years ago
- Apache Spark docker image☆2,059Apr 21, 2023Updated 2 years ago
- A sample project shows how to run Spark Streaming app with Kafka in Docker☆36Oct 25, 2017Updated 8 years ago
- Scala Exercises' lessons for the Shapeless library☆23Mar 31, 2023Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆509Nov 7, 2025Updated 4 months ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Sep 20, 2019Updated 6 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Oct 29, 2015Updated 10 years ago
- Apache Hadoop docker image☆2,313Feb 1, 2024Updated 2 years ago
- Automatically create GitHub repositories using YAML templates.☆12Jul 10, 2022Updated 3 years ago
- A big data web application to predict USA airline traffic delay with Python, Flask, Apache Spark, Kafka, MongoDB, ElasticSearch, d3.js, s…☆30Jul 25, 2022Updated 3 years ago
- ☆20Jul 24, 2019Updated 6 years ago
- Straws是一款开源的离线数据同步中间件(ETL),提供Mysql、SqlServer等离线同步场景,同时支持定时同步(全量、增量、CDC三种模式)和数据转换清洗等功能☆11Jul 31, 2022Updated 3 years ago
- This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.☆11Mar 19, 2024Updated last year
- ☆1,081Jun 2, 2024Updated last year