A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!
☆23May 20, 2025Updated 11 months ago
Alternatives and similar repositories for docker-big-data-cluster
Users that are interested in docker-big-data-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run Hadoop Cluster within Docker Containers.☆16Mar 6, 2025Updated last year
- The viadee Process Warehouse - Explore what happens in your BPMN processes☆13Dec 19, 2023Updated 2 years ago
- A json version of the OpenCyc-latest.owl Ontology☆13Oct 27, 2011Updated 14 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆36Dec 9, 2024Updated last year
- Hive Storage Handler for SOLR☆16Mar 17, 2014Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A python library allowing to import and export metabase database configuration from a metabase API☆22Aug 2, 2024Updated last year
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆14Sep 2, 2022Updated 3 years ago
- 视频AI科普教程——视频运动检测☆17Oct 13, 2020Updated 5 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- A super-minimal Docker Compose template for Apache Superset.☆18Jan 31, 2024Updated 2 years ago
- Consume a stream of data into a binary Buffer as efficiently as possible☆12Jun 21, 2018Updated 7 years ago
- ☆16Feb 18, 2025Updated last year
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repo is to create a full kubernetes cluster using k3d with MetalLb, prometheus, cert-manager, and traefik.☆12Mar 5, 2026Updated last month
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 5 years ago
- Airflow DAGs and Scripts for Pushing Daily CSV dumps into Singer☆23May 8, 2017Updated 8 years ago
- A template to create CVs/Resumes with Quarto☆10Jul 17, 2023Updated 2 years ago
- Purple CMS - Purple is Awesome☆18Jan 27, 2024Updated 2 years ago
- Automate DNS for your (kube) VIPs☆13Aug 29, 2021Updated 4 years ago
- Spark all the ETL Pipelines☆37Aug 2, 2023Updated 2 years ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- From pocket to chrome bookmarks☆12Jan 7, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Machine Learning Quick Reference, published by Packt☆17Jan 30, 2023Updated 3 years ago
- Scala embedded universal probabilistic programming language☆11Apr 15, 2021Updated 5 years ago
- Mouse replacement software to use computers with your eyes with support of a compatible Tobii Eye Tracker.☆10Jul 5, 2020Updated 5 years ago
- An NTLM, NTLM2SR, and NTLMv2 authenticating HTTP proxy☆10Dec 5, 2015Updated 10 years ago
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆86Jan 2, 2025Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- streaming data pipeline platform☆30Jan 4, 2026Updated 3 months ago
- Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds☆11Oct 28, 2019Updated 6 years ago
- My Kubernetes Cluster (k3s) managed by GitOps (Flux2)☆17Nov 14, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JSON Serde for Hive☆21Oct 13, 2011Updated 14 years ago
- DBImport ingestion tool. Handle import, export and standard ETL flows in Hadoop/Hive☆19Feb 17, 2026Updated 2 months ago
- Theo dõi biến động giá sản phẩm TIKI với Github Actions☆14Jan 16, 2022Updated 4 years ago
- ☆90Sep 24, 2022Updated 3 years ago
- This is a Helm chart that lets you create a Kubernetes storage class for creating local persistent volumes for a local Kubernetes cluster…☆13Nov 10, 2019Updated 6 years ago
- An example for using the Rust language to write Spark UDFs☆13Jun 22, 2021Updated 4 years ago
- CI/CD platform using Jenkins, docker, Sonar, Nexus, Jmeter, Selenium, Ansible, AWX, Grafana, Prometheus, Zabbix, Stress-ng☆21Apr 26, 2026Updated last week