A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!
☆22May 20, 2025Updated last year
Alternatives and similar repositories for docker-big-data-cluster
Users that are interested in docker-big-data-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run Hadoop Cluster within Docker Containers.☆16Mar 6, 2025Updated last year
- Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn☆50Dec 7, 2020Updated 5 years ago
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- AKS Course - Pluralsight☆10Oct 29, 2022Updated 3 years ago
- Repository for building docker image, with open-source applications☆26Apr 23, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Machine Learning DevOps Engineer Nanodegree☆11Jan 27, 2022Updated 4 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆13Jun 25, 2023Updated 2 years ago
- Starter Code for the Course 2 project of the Udacity ML DevOps Nanodegree Program☆22Jun 20, 2024Updated last year
- A json version of the OpenCyc-latest.owl Ontology☆13Oct 27, 2011Updated 14 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆35Dec 9, 2024Updated last year
- Hive Storage Handler for SOLR☆16Mar 17, 2014Updated 12 years ago
- ☆144Apr 21, 2022Updated 4 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Lessons, exercises and projects of Udacity Cloud Devops Nanodegree☆14Dec 30, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Feb 18, 2025Updated last year
- sql engine for csv files☆16Nov 3, 2016Updated 9 years ago
- Example repo for web scraping with Sveltekit API routes, Puppeteer, and Vercel Blob Storage☆12May 7, 2024Updated 2 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆169Feb 4, 2021Updated 5 years ago
- Simple PCB for Wemos D1 Mini ESP8266 and an A4988 stepper driver☆11Sep 30, 2023Updated 2 years ago
- Crawlyx is an open-source command-line interface (CLI) based web crawler built using Node.js. It is designed to crawl websites and extrac…☆13Apr 12, 2025Updated last year
- Provides reusable MSBuild tasks and sample Visual Studio tooling for building and debugging Mono AOT compiled binaries☆14Jan 17, 2020Updated 6 years ago
- Purple CMS - Purple is Awesome☆18Jan 27, 2024Updated 2 years ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- From pocket to chrome bookmarks☆12Jan 7, 2017Updated 9 years ago
- Machine Learning Quick Reference, published by Packt☆17Jan 30, 2023Updated 3 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆51Dec 2, 2023Updated 2 years ago
- Mouse replacement software to use computers with your eyes with support of a compatible Tobii Eye Tracker.☆10Jul 5, 2020Updated 5 years ago
- Scala embedded universal probabilistic programming language☆11Apr 15, 2021Updated 5 years ago
- An NTLM, NTLM2SR, and NTLMv2 authenticating HTTP proxy☆10Dec 5, 2015Updated 10 years ago
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆84Jan 2, 2025Updated last year
- A group of examples based on the CSE pipleline.☆10May 13, 2013Updated 13 years ago
- streaming data pipeline platform☆30Jun 3, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The OpenJur is an administrative Open Source system for lawyers and law firms of any size.☆15Jun 2, 2016Updated 10 years ago
- 将P站下载的动图压缩包转换为Gif图。☆10Nov 25, 2017Updated 8 years ago
- My Kubernetes Cluster (k3s) managed by GitOps (Flux2)☆17Nov 14, 2023Updated 2 years ago
- JSON Serde for Hive☆21Oct 13, 2011Updated 14 years ago
- Theo dõi biến động giá sản phẩm TIKI với Github Actions☆14Jan 16, 2022Updated 4 years ago
- ☆20May 9, 2023Updated 3 years ago
- openMDX Documentation☆14Apr 16, 2025Updated last year