A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!
☆23May 20, 2025Updated 10 months ago
Alternatives and similar repositories for docker-big-data-cluster
Users that are interested in docker-big-data-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run Hadoop Cluster within Docker Containers.☆16Mar 6, 2025Updated last year
- Statistical Analysis: Project with A/B testing and Machine Learning methodologies☆16Jan 9, 2019Updated 7 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆36Dec 9, 2024Updated last year
- Hive Storage Handler for SOLR☆16Mar 17, 2014Updated 12 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆147Apr 21, 2022Updated 3 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- A super-minimal Docker Compose template for Apache Superset.☆18Jan 31, 2024Updated 2 years ago
- Consume a stream of data into a binary Buffer as efficiently as possible☆12Jun 21, 2018Updated 7 years ago
- sql engine for csv files☆16Nov 3, 2016Updated 9 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- This repo is to create a full kubernetes cluster using k3d with MetalLb, prometheus, cert-manager, and traefik.☆12Mar 5, 2026Updated last month
- A more pretty, more usable web dashboard for Apache Oozie, written in Scala.☆72May 6, 2013Updated 12 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Rust HAL repp☆12Apr 25, 2022Updated 3 years ago
- Simple PCB for Wemos D1 Mini ESP8266 and an A4988 stepper driver☆11Sep 30, 2023Updated 2 years ago
- A template to create CVs/Resumes with Quarto☆10Jul 17, 2023Updated 2 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- Provides reusable MSBuild tasks and sample Visual Studio tooling for building and debugging Mono AOT compiled binaries☆14Jan 17, 2020Updated 6 years ago
- Loops in Oozie☆10Feb 15, 2015Updated 11 years ago
- Foodmart data set in hsqldb format☆26Oct 19, 2025Updated 5 months ago
- From pocket to chrome bookmarks☆12Jan 7, 2017Updated 9 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆52Dec 2, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Visualization tool for the Oozie workflows.☆20Feb 7, 2013Updated 13 years ago
- Scala embedded universal probabilistic programming language☆11Apr 15, 2021Updated 4 years ago
- Mouse replacement software to use computers with your eyes with support of a compatible Tobii Eye Tracker.☆10Jul 5, 2020Updated 5 years ago
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆86Jan 2, 2025Updated last year
- A group of examples based on the CSE pipleline.☆10May 13, 2013Updated 12 years ago
- The OpenJur is an administrative Open Source system for lawyers and law firms of any size.☆15Jun 2, 2016Updated 9 years ago
- Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)☆13Updated this week
- Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds☆11Oct 28, 2019Updated 6 years ago
- My Kubernetes Cluster (k3s) managed by GitOps (Flux2)☆17Nov 14, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Theo dõi biến động giá sản phẩm TIKI với Github Actions☆14Jan 16, 2022Updated 4 years ago
- Searchmonkey - Power searching without the pain. Perform powerful desktop searches using regular expressions. Graphical equivalent to fin…☆17Mar 27, 2018Updated 8 years ago
- AI-powered code review assistant for GitHub Pull Requests using OpenAI GPT-4 and Claude with automated feedback and analytics dashboard.☆23Dec 13, 2025Updated 3 months ago
- XMPP Plugin for http://meetfranz.com☆11Mar 25, 2019Updated 7 years ago
- Learn to use postgrest by following http://blog.jonharrington.org/postgrest-introduction/☆16Jul 22, 2015Updated 10 years ago
- This is a Helm chart that lets you create a Kubernetes storage class for creating local persistent volumes for a local Kubernetes cluster…☆13Nov 10, 2019Updated 6 years ago
- This is the repo for the newmap.ai project: language and interpreter☆12Aug 4, 2024Updated last year