A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!
☆22May 20, 2025Updated last year
Alternatives and similar repositories for docker-big-data-cluster
Users that are interested in docker-big-data-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run Hadoop Cluster within Docker Containers.☆16Mar 6, 2025Updated last year
- Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn☆50Dec 7, 2020Updated 5 years ago
- OCR as a service☆17Dec 11, 2016Updated 9 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆35Dec 9, 2024Updated last year
- Hive Storage Handler for SOLR☆16Mar 17, 2014Updated 12 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A python library allowing to import and export metabase database configuration from a metabase API☆22Aug 2, 2024Updated last year
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆14Sep 2, 2022Updated 3 years ago
- Starting up a Kubernetes cluster with Vagrant, with Gluster, Portworx, Linstor, or StorageOS as storage provider and Traefik as ingress c…☆11May 25, 2022Updated 4 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Consume a stream of data into a binary Buffer as efficiently as possible☆12Jun 21, 2018Updated 8 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- This repo is to create a full kubernetes cluster using k3d with MetalLb, prometheus, cert-manager, and traefik.☆12Mar 5, 2026Updated 3 months ago
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Airflow DAGs and Scripts for Pushing Daily CSV dumps into Singer☆23May 8, 2017Updated 9 years ago
- A more pretty, more usable web dashboard for Apache Oozie, written in Scala.☆72May 6, 2013Updated 13 years ago
- Simple PCB for Wemos D1 Mini ESP8266 and an A4988 stepper driver☆11Sep 30, 2023Updated 2 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 3 years ago
- Loops in Oozie☆10Feb 15, 2015Updated 11 years ago
- Automate DNS for your (kube) VIPs☆13Aug 29, 2021Updated 4 years ago
- ☆20Jul 15, 2015Updated 10 years ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- From pocket to chrome bookmarks☆12Jan 7, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Power BI REST API function wrappers for sending Spark data to Power BI Push Datasets☆15Apr 22, 2019Updated 7 years ago
- Visualization tool for the Oozie workflows.☆20Feb 7, 2013Updated 13 years ago
- Mouse replacement software to use computers with your eyes with support of a compatible Tobii Eye Tracker.☆10Jul 5, 2020Updated 5 years ago
- Scala embedded universal probabilistic programming language☆11Apr 15, 2021Updated 5 years ago
- An NTLM, NTLM2SR, and NTLMv2 authenticating HTTP proxy☆10Dec 5, 2015Updated 10 years ago
- A python script to print battery charge level of some bluetooth headsets☆15Jan 13, 2020Updated 6 years ago
- The OpenJur is an administrative Open Source system for lawyers and law firms of any size.☆15Jun 2, 2016Updated 10 years ago
- JSON Serde for Hive☆21Oct 13, 2011Updated 14 years ago
- DBImport ingestion tool. Handle import, export and standard ETL flows in Hadoop/Hive☆19Feb 17, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Theo dõi biến động giá sản phẩm TIKI với Github Actions☆14Jan 16, 2022Updated 4 years ago
- ☆90Sep 24, 2022Updated 3 years ago
- ☆10Feb 28, 2020Updated 6 years ago
- XMPP Plugin for http://meetfranz.com☆11Mar 25, 2019Updated 7 years ago
- The 411 on basic guerrilla grafting skills. Now in Spanish!☆12Jun 14, 2018Updated 8 years ago
- Various text analytics tutorials☆13May 16, 2017Updated 9 years ago
- This is a Helm chart that lets you create a Kubernetes storage class for creating local persistent volumes for a local Kubernetes cluster…☆13Nov 10, 2019Updated 6 years ago