marcelmittelstaedt / BigData
Lecture: Big Data
☆15Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for BigData
- Infraestructura para Big Data : Hadoop + NiFi +Spark + Hive usando Docker☆19Updated last year
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Updated last year
- Run Hadoop Cluster within Docker Containers.☆16Updated 2 months ago
- Starting up a Kubernetes cluster with Vagrant, with Gluster, Portworx, Linstor, or StorageOS as storage provider and Traefik as ingress c…☆11Updated 2 years ago
- Zeppelin docker☆15Updated 4 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary…☆28Updated 3 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆11Updated 3 years ago
- This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.☆51Updated 6 years ago
- Run Hadoop Custer within Docker Containers☆25Updated 4 months ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆40Updated 11 months ago
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 5 years ago
- A consumer of a Kafka topic based on Flink☆12Updated 2 years ago
- Spark + Jupyer + Hive☆16Updated 9 years ago
- Hadoop, MapReduce, HDFS, Spark, Pig, Hive, HBase, MongoDB, Cassandra, Flume - the list goes on! Over 25 technologies.☆10Updated 6 years ago
- Dockerfiles and Docker Compose for HDP 2.6 with Blueprints☆23Updated 6 years ago
- Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink☆28Updated last year
- Cloud-native Trino (prestosql) + Hive + Minio + Superset☆21Updated 2 years ago
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆61Updated 5 months ago
- ☆12Updated 2 years ago
- Springboot + ElasticSearch 构建博客检索系统☆12Updated 4 years ago
- Ansible roles to install an Spark Standalone cluster (HDFS/Spark/Jupyter Notebook) or Ambari based Spark cluster☆62Updated 9 months ago
- A dockerized small bigdata cluster to play with☆13Updated 8 years ago
- Spring Boot Crud Application with Unit Testing using JUnit & Mockito☆15Updated 3 years ago
- Big Data Docker Data Science Spark Spark3 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook☆16Updated this week
- Conduct a Report and Analysis on 200,000 sales data points to answer revenue-related questions for the business☆22Updated 3 years ago
- Hadoop, Hive, Parquet and Hue in docker-compose v3☆40Updated 4 years ago
- Spark Streaming HBase Example☆22Updated 8 years ago