A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!
☆23May 20, 2025Updated 10 months ago
Alternatives and similar repositories for docker-big-data-cluster
Users that are interested in docker-big-data-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn☆51Dec 7, 2020Updated 5 years ago
- Repo for practical data science problems approaches, including notebook demo and working scripts | #DS | #analysis☆12Oct 13, 2020Updated 5 years ago
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- AKS Course - Pluralsight☆10Oct 29, 2022Updated 3 years ago
- Repository for building docker image, with open-source applications☆26Apr 23, 2024Updated last year
- Machine Learning DevOps Engineer Nanodegree☆11Jan 27, 2022Updated 4 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆14Jun 25, 2023Updated 2 years ago
- 以慕课网日志分析为例 进入大数据 Spark SQL 的世界☆15Apr 3, 2018Updated 7 years ago
- A json version of the OpenCyc-latest.owl Ontology☆13Oct 27, 2011Updated 14 years ago
- OCR as a service☆15Dec 11, 2016Updated 9 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆36Dec 9, 2024Updated last year
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- ☆146Apr 21, 2022Updated 3 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆14Sep 2, 2022Updated 3 years ago
- Starting up a Kubernetes cluster with Vagrant, with Gluster, Portworx, Linstor, or StorageOS as storage provider and Traefik as ingress c…☆11May 25, 2022Updated 3 years ago
- A super-minimal Docker Compose template for Apache Superset.☆17Jan 31, 2024Updated 2 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Lessons, exercises and projects of Udacity Cloud Devops Nanodegree☆13Dec 30, 2022Updated 3 years ago
- ☆16Feb 18, 2025Updated last year
- This repo is to create a full kubernetes cluster using k3d with MetalLb, prometheus, cert-manager, and traefik.☆12Mar 5, 2026Updated 2 weeks ago
- A more pretty, more usable web dashboard for Apache Oozie, written in Scala.☆72May 6, 2013Updated 12 years ago
- A template to create CVs/Resumes with Quarto☆10Jul 17, 2023Updated 2 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- Crawlyx is an open-source command-line interface (CLI) based web crawler built using Node.js. It is designed to crawl websites and extrac…☆13Apr 12, 2025Updated 11 months ago
- Purple CMS - Purple is Awesome☆18Jan 27, 2024Updated 2 years ago
- ☆21Jul 15, 2015Updated 10 years ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- Scala embedded universal probabilistic programming language☆11Apr 15, 2021Updated 4 years ago
- Exadel Activity-based Security Framework☆19Dec 10, 2022Updated 3 years ago
- An NTLM, NTLM2SR, and NTLMv2 authenticating HTTP proxy☆10Dec 5, 2015Updated 10 years ago
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆86Jan 2, 2025Updated last year
- streaming data pipeline platform☆29Jan 4, 2026Updated 2 months ago
- A group of examples based on the CSE pipleline.☆10May 13, 2013Updated 12 years ago
- 将P站下载的动图压缩包转换为Gif图。☆10Nov 25, 2017Updated 8 years ago
- Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)☆13Updated this week
- Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds☆11Oct 28, 2019Updated 6 years ago
- DBImport ingestion tool. Handle import, export and standard ETL flows in Hadoop/Hive☆19Feb 17, 2026Updated last month
- Theo dõi biến động giá sản phẩm TIKI với Github Actions☆14Jan 16, 2022Updated 4 years ago
- openMDX Documentation☆14Apr 16, 2025Updated 11 months ago