suhothayan / hadoop-spark-pig-hive
Docker with hadoop spark pig hive
☆24Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for hadoop-spark-pig-hive
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆61Updated 5 months ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆161Updated 3 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆119Updated 3 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆67Updated 3 years ago
- Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink☆28Updated last year
- Code examples on Apache Spark using python☆105Updated 2 years ago
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 5 years ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Updated 4 years ago
- Kafka streaming with Spark and Flink example☆30Updated last year
- Quickly set up a POC environment for Kafka+Spark☆16Updated 7 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 6 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- Hands-On Big Data Analytics with PySpark, Published by Packt☆34Updated last year
- Run Hadoop Cluster within Docker Containers.☆16Updated 2 months ago
- ☆90Updated 2 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆40Updated 11 months ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆11Updated 3 years ago
- PySpark Cookbook, published by Packt☆89Updated last year
- Multi-container environment with Hadoop, Spark and Hive☆203Updated 10 months ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- A consumer of a Kafka topic based on Flink☆12Updated 2 years ago
- The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big d…☆31Updated 5 years ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆42Updated last year
- Spark Examples☆124Updated 2 years ago
- A Basic Flink Application Consuming & Aggregating Kafka Messages☆10Updated 5 years ago
- A simple spark standalone cluster for your testing environment purposses☆558Updated 8 months ago
- A Spark cluster setup running on Docker containers☆60Updated 4 years ago
- Repo for all my code on the articles I post on medium☆105Updated 2 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆41Updated 5 years ago
- Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.☆15Updated 5 years ago