A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!
☆22May 20, 2025Updated last year
Alternatives and similar repositories for docker-big-data-cluster
Users that are interested in docker-big-data-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for practical data science problems approaches, including notebook demo and working scripts | #DS | #analysis☆12Oct 13, 2020Updated 5 years ago
- AKS Course - Pluralsight☆10Oct 29, 2022Updated 3 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆14Jun 25, 2023Updated 2 years ago
- Starter Code for the Course 2 project of the Udacity ML DevOps Nanodegree Program☆22Jun 20, 2024Updated last year
- A Django + PyPDF2 application extracting PDF pages, merging and replacing PDF files online.☆18Nov 13, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆35Dec 9, 2024Updated last year
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆14Sep 2, 2022Updated 3 years ago
- Starting up a Kubernetes cluster with Vagrant, with Gluster, Portworx, Linstor, or StorageOS as storage provider and Traefik as ingress c…☆11May 25, 2022Updated 3 years ago
- ☆16Feb 18, 2025Updated last year
- sql engine for csv files☆16Nov 3, 2016Updated 9 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆169Feb 4, 2021Updated 5 years ago
- This repo is to create a full kubernetes cluster using k3d with MetalLb, prometheus, cert-manager, and traefik.☆12Mar 5, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Airflow DAGs and Scripts for Pushing Daily CSV dumps into Singer☆23May 8, 2017Updated 9 years ago
- Simple PCB for Wemos D1 Mini ESP8266 and an A4988 stepper driver☆11Sep 30, 2023Updated 2 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 3 years ago
- Spark all the ETL Pipelines☆37Aug 2, 2023Updated 2 years ago
- CKAN Extensions☆12Aug 26, 2021Updated 4 years ago
- ☆21Jul 15, 2015Updated 10 years ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- From pocket to chrome bookmarks☆12Jan 7, 2017Updated 9 years ago
- Machine Learning Quick Reference, published by Packt☆17Jan 30, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A safe and convenient store for one value of each type☆11Apr 10, 2021Updated 5 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆52Dec 2, 2023Updated 2 years ago
- Mouse replacement software to use computers with your eyes with support of a compatible Tobii Eye Tracker.☆10Jul 5, 2020Updated 5 years ago
- An NTLM, NTLM2SR, and NTLMv2 authenticating HTTP proxy☆10Dec 5, 2015Updated 10 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- The OpenJur is an administrative Open Source system for lawyers and law firms of any size.☆15Jun 2, 2016Updated 9 years ago
- 将P站下载的动图压缩包转换为Gif图。☆10Nov 25, 2017Updated 8 years ago
- My Kubernetes Cluster (k3s) managed by GitOps (Flux2)☆17Nov 14, 2023Updated 2 years ago
- DBImport ingestion tool. Handle import, export and standard ETL flows in Hadoop/Hive☆19Feb 17, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Theo dõi biến động giá sản phẩm TIKI với Github Actions☆14Jan 16, 2022Updated 4 years ago
- Searchmonkey - Power searching without the pain. Perform powerful desktop searches using regular expressions. Graphical equivalent to fin…☆17May 1, 2026Updated 3 weeks ago
- ☆10Feb 28, 2020Updated 6 years ago
- Various text analytics tutorials☆13May 16, 2017Updated 9 years ago
- This is a Helm chart that lets you create a Kubernetes storage class for creating local persistent volumes for a local Kubernetes cluster…☆13Nov 10, 2019Updated 6 years ago
- This is the repo for the newmap.ai project: language and interpreter☆12Aug 4, 2024Updated last year
- A low code editor with the full power of flutter. created by @sanihaq for @flutter🌸☆11Aug 1, 2021Updated 4 years ago