Dockerizing an Apache Spark Standalone Cluster
☆42Jun 29, 2022Updated 3 years ago
Alternatives and similar repositories for apache-spark-docker
Users that are interested in apache-spark-docker are comparing it to the libraries listed below
Sorting:
- The goal of this project is to identify students at risk of dropping out the school☆22May 7, 2021Updated 4 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆28Jun 13, 2022Updated 3 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- Infraestructura para Big Data : Hadoop + NiFi +Spark + Hive usando Docker☆20Jan 5, 2026Updated last month
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- Lecture: Big Data☆14Oct 27, 2025Updated 4 months ago
- ☆10Jun 3, 2023Updated 2 years ago
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆13Jun 13, 2022Updated 3 years ago
- Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary…☆31Jul 6, 2021Updated 4 years ago
- CI/CD platform using Jenkins, docker, Sonar, Nexus, Jmeter, Selenium, Ansible, AWX, Grafana, Prometheus, Zabbix, Stress-ng☆21Feb 5, 2026Updated 3 weeks ago
- 拉比克是一个开源大数据平台构建方案,已稳定应用于生产集群。融合Hadoop、Hive、Hbase、zookeeper等如CDH☆14Mar 11, 2019Updated 6 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆13Jun 29, 2022Updated 3 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆20Aug 5, 2022Updated 3 years ago
- A docker image with a pre-configured Hive Metastore and a Spark ThriftServer☆19Jan 20, 2020Updated 6 years ago
- Zeppelin docker☆16Nov 16, 2020Updated 5 years ago
- ☆21Jul 3, 2019Updated 6 years ago
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago
- Cloud-native Trino (prestosql) + Hive + Minio + Superset☆24Nov 29, 2021Updated 4 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Feb 27, 2024Updated 2 years ago
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated last year
- ☆11May 28, 2025Updated 9 months ago
- Learn various Algorithms of Machine Learning like SVC, Decision Tree , Random Forest , Logistic Regression, Linear Regression and much Mo…☆11Jul 31, 2019Updated 6 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- Natural Language Processing☆11Jun 23, 2021Updated 4 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- A simplified, generic, entity based web library for golang that's drop in compatible with net/http☆10Jul 14, 2023Updated 2 years ago
- 数据库实训平台(前端项目)☆15Feb 27, 2023Updated 3 years ago
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- Node-RED Flow (and web page example) for the LLaMA AI model☆11Jul 27, 2023Updated 2 years ago
- Multi-container environment with Hadoop, Spark and Hive☆232May 5, 2025Updated 9 months ago
- Just a boilerplate for PySpark and Flask☆36Aug 2, 2018Updated 7 years ago
- Duckdb extension to read pcap files☆48Sep 23, 2025Updated 5 months ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 3 years ago
- A scraper made using beautiful soup 4 in python. Tailor made for extracting news from moneycontrol.com. Issue pull request for different …☆12Jun 21, 2020Updated 5 years ago
- A Python library to simplify batch requests to AWS Services☆12Apr 25, 2020Updated 5 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆104Sep 26, 2025Updated 5 months ago
- ☆38Feb 23, 2026Updated last week