big-data-europe / docker-hadoop
Apache Hadoop docker image
☆2,253Updated last year
Alternatives and similar repositories for docker-hadoop:
Users that are interested in docker-hadoop are comparing it to the libraries listed below
- ☆1,050Updated 10 months ago
- Apache Spark docker image☆2,054Updated 2 years ago
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆692Updated 4 years ago
- ☆250Updated 2 years ago
- Run Hadoop Custer within Docker Containers☆1,813Updated 9 months ago
- Multi-container environment with Hadoop, Spark and Hive☆211Updated last year
- Hadoop docker image☆1,211Updated 4 years ago
- Apache Flink docker image☆193Updated 2 years ago
- Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.☆2,211Updated this week
- A connector for Spark that allows reading and writing to/from Redis cluster☆947Updated 6 months ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆164Updated 4 years ago
- Docker build for Apache Spark☆673Updated 3 years ago
- Apache Flink Playgrounds☆518Updated 2 months ago
- 基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark☆302Updated 5 years ago
- A simple spark standalone cluster for your testing environment purposses☆571Updated last year
- 50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian,…☆1,340Updated last month
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,476Updated this week
- Azkaban workflow manager.☆4,492Updated 9 months ago
- HBase running in Docker☆331Updated 2 years ago
- Mirror of Apache Bahir Flink☆786Updated last year
- Mirror of Apache griffin☆1,155Updated 3 months ago
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆635Updated 2 weeks ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,178Updated last week
- Apache Flink Training Excercises☆950Updated 8 months ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆553Updated 3 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆909Updated last week
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆72Updated 3 months ago
- ☆550Updated 3 years ago
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond☆946Updated 2 weeks ago
- Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch☆841Updated 5 years ago