big-data-europe / docker-hadoop-spark-workbench
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
☆692Updated 4 years ago
Alternatives and similar repositories for docker-hadoop-spark-workbench:
Users that are interested in docker-hadoop-spark-workbench are comparing it to the libraries listed below
- Apache Spark docker image☆2,054Updated 2 years ago
- Docker build for Apache Spark☆673Updated 3 years ago
- ☆250Updated 2 years ago
- Apache Hadoop docker image☆2,253Updated last year
- Apache Flink docker image☆193Updated 2 years ago
- ☆764Updated 4 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆553Updated 3 years ago
- The Internals of Spark Structured Streaming☆418Updated 2 years ago
- Hadoop docker image☆1,211Updated 4 years ago
- ☆1,050Updated 10 months ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 2 years ago
- Scala examples for learning to use Spark☆445Updated 4 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆164Updated 4 years ago
- A connector for Spark that allows reading and writing to/from Redis cluster☆947Updated 6 months ago
- HBase running in Docker☆331Updated 2 years ago
- Examples for High Performance Spark☆508Updated 5 months ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆909Updated last week
- A simple spark standalone cluster for your testing environment purposses☆571Updated last year
- Docker image with Ambari☆290Updated 7 years ago
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆292Updated 2 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆585Updated last year
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated 2 years ago
- The Internals of Spark SQL☆465Updated 3 months ago
- Run Hadoop Custer within Docker Containers☆1,813Updated 9 months ago
- A Spark plugin for reading and writing Excel files☆490Updated last week
- Connect Spark to HBase for reading and writing data with ease☆297Updated 7 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆575Updated 9 months ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 7 years ago
- ☆550Updated 3 years ago
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated 2 years ago