myamafuj / hadoop-hive-spark-dockerView external linksLinks
Hadoop-Hive-Spark cluster + Jupyter on Docker
☆83Jan 2, 2025Updated last year
Alternatives and similar repositories for hadoop-hive-spark-docker
Users that are interested in hadoop-hive-spark-docker are comparing it to the libraries listed below
Sorting:
- Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink☆29Oct 9, 2023Updated 2 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- ☆146Apr 21, 2022Updated 3 years ago
- Scripts for installing Hadoop, HBase, Hive, Pig & Spark.☆10Nov 13, 2019Updated 6 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆13May 2, 2021Updated 4 years ago
- Multi-container environment with Hadoop, Spark and Hive☆232May 5, 2025Updated 9 months ago
- ☆15Feb 17, 2020Updated 6 years ago
- Spark + Jupyer + Hive☆16Sep 22, 2015Updated 10 years ago
- Run Hadoop Cluster within Docker Containers.☆16Mar 6, 2025Updated 11 months ago
- A docker image with a pre-configured Hive Metastore and a Spark ThriftServer☆19Jan 20, 2020Updated 6 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 3 years ago
- This is a recipe for docker container based architecture based on airflow, kafka,spark,docker☆20Oct 15, 2024Updated last year
- 源码主要用于学习:1. Spring Boot+Hadoop+Hive+Hbase实现数据基本操作,Hive数据源使用Alibaba DruidDataSource,以及JDBCTemplate操作数据, Hbase使用hbase-client实现数据操作, API可视化界…☆22Jul 27, 2021Updated 4 years ago
- A Hadoop cluster based on Docker, including Hive and Spark.☆83Nov 13, 2022Updated 3 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆507Nov 7, 2025Updated 3 months ago
- Extract transform load CLI tool for extracting small and middle data volume from sources (databases, csv files, xls files, gspreadsheets)…☆11Dec 17, 2025Updated 2 months ago
- Netty教程 - Netty是一个java开源框架。Netty提供异步的、事件驱动的网络应用程序框架和工具,用以快速开发高性能、高可靠性的网络服务器和客户端程序。☆28Mar 14, 2017Updated 8 years ago
- A sample project shows how to run Spark Streaming app with Kafka in Docker☆33Oct 25, 2017Updated 8 years ago
- Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.☆32Apr 25, 2023Updated 2 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Jul 20, 2021Updated 4 years ago
- Straws是一款开源的离线数据同步中间件(ETL),提供Mysql、SqlServer等离线同步场景,同时支持定时同步(全量、增量、CDC三种模式)和数据转换清洗等功能☆11Jul 31, 2022Updated 3 years ago
- ☆19Jul 24, 2019Updated 6 years ago
- 基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark☆307May 26, 2019Updated 6 years ago
- 一键搭建zookeeper/hadoop/hive/hbase/sqoop/kafka/spark/kylin☆33Jan 2, 2020Updated 6 years ago
- 悟空客户管理:客户关系管理是指企业为提高核心竞争力,利用相应的信息技术以及互联网技术协调企业与顾客间在销售、营销和服务上的交互,从而提升其管理方式,向客户提供创新式的个性化的客户交互和服务的过程。其最终目标是吸引新客户、保留老客户以及将已有客户转为忠实客户,增加市场。☆10Jun 21, 2022Updated 3 years ago
- Datasets and models included in the book "Introduction to Bayesian Data Analysis for Cognitive Science".☆16Dec 11, 2025Updated 2 months ago
- Kafka library with a schema registry integration☆10Dec 16, 2025Updated 2 months ago
- Big Data Inventory Management on AWS (Demand Forecasting, Machine Learning, Dashboarding) : Presented at Carlson School of Management dur…☆11Apr 15, 2020Updated 5 years ago
- Docker powered container for using Nginx as reverse-proxy in combination with an OpenVPN Client.☆11Jan 1, 2020Updated 6 years ago
- Example of authentication via Auth0 for react-admin☆11Sep 20, 2020Updated 5 years ago
- Um template com várias configurações para aspnet-core☆15Mar 4, 2023Updated 2 years ago
- hadoop中Map/Reduce使用示例,输入(DBInputFormat),输出(DBOutputFormat)为MySql数据库表、日志分析Grep、单词排序Sort...对HBase的基本操作,增、删、查、改,使用Map/Reduce批量导入数据到HBase表中..…☆14Apr 6, 2013Updated 12 years ago
- Hadoop/Hive/Spark container to perform CI tests☆10Dec 26, 2020Updated 5 years ago
- Content related to style guides and conventions around API Design☆11Jul 22, 2020Updated 5 years ago
- CRUD with Authentication and Authorization using Get x cli pattern and Supabase☆11Nov 5, 2023Updated 2 years ago
- Depenency free (so far) Vanilla JS Dashboard UI for the mediamtx streaming server. Dockerized.☆28Feb 2, 2026Updated 2 weeks ago
- Kubenetes with SpringBoot demo☆10Feb 20, 2019Updated 6 years ago
- Apache Spark docker image☆2,058Apr 21, 2023Updated 2 years ago