suhothayan / hadoop-spark-pig-hive
Docker with hadoop spark pig hive
☆24Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for hadoop-spark-pig-hive
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆11Updated 3 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆67Updated 3 years ago
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆161Updated 3 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆41Updated 5 years ago
- Repo for all my code on the articles I post on medium☆105Updated 2 years ago
- Hadoop, Hive, Parquet and Hue in docker-compose v3☆40Updated 4 years ago
- Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink☆27Updated last year
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆61Updated 5 months ago
- Deploy your Spark Production Cluster on Kubernetes☆47Updated 4 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆119Updated 3 years ago
- spark on kubernetes☆105Updated last year
- Multi-container environment with Hadoop, Spark and Hive☆202Updated 10 months ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary…☆28Updated 3 years ago
- The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big d…☆31Updated 5 years ago
- Hands-On Big Data Analytics with PySpark, Published by Packt☆34Updated last year
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 5 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆39Updated 3 years ago
- Spark Streaming HBase Example☆22Updated 8 years ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆42Updated last year
- Set up a 3 node spark cluster using docker containers☆33Updated 6 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆67Updated 7 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Infraestructura para Big Data : Hadoop + NiFi +Spark + Hive usando Docker☆19Updated last year
- ☆105Updated 4 years ago
- Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.☆15Updated 5 years ago
- Creditcard Fruad detection☆20Updated 2 years ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'☆119Updated last year