big-data-europe / docker-hadoop-spark-workbench
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
☆687Updated 3 years ago
Related projects: ⓘ
- Apache Spark docker image☆2,034Updated last year
- Docker build for Apache Spark☆676Updated 2 years ago
- ☆1,016Updated 3 months ago
- Apache Hadoop docker image☆2,183Updated 7 months ago
- The Internals of Spark Structured Streaming☆415Updated last year
- A simple spark standalone cluster for your testing environment purposses☆548Updated 6 months ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆552Updated 3 years ago
- ☆246Updated last year
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Updated last year
- Examples for High Performance Spark☆497Updated 3 weeks ago
- Scala examples for learning to use Spark☆444Updated 4 years ago
- The Internals of Apache Spark☆1,461Updated this week
- ☆765Updated 3 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆880Updated this week
- Apache Flink docker image☆190Updated 2 years ago
- Hadoop docker image☆1,212Updated 4 years ago
- ☆235Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆581Updated 7 months ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆158Updated 3 years ago
- HBase running in Docker☆328Updated last year
- Qubole Sparklens tool for performance tuning Apache Spark☆561Updated 2 months ago
- The Internals of Spark SQL☆447Updated last month
- Jupyter magics and kernels for working with remote Spark clusters☆1,315Updated last month
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆608Updated last week
- Kafka Connect HDFS connector☆7Updated last week
- The MongoDB Spark Connector☆708Updated 3 weeks ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆692Updated last month
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)☆727Updated last week
- Mirror of Apache Toree (Incubating)☆737Updated 2 weeks ago
- StreamSets Tutorials☆345Updated last month