big-data-europe / docker-spark
Apache Spark docker image
☆2,050Updated last year
Alternatives and similar repositories for docker-spark:
Users that are interested in docker-spark are comparing it to the libraries listed below
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆692Updated 4 years ago
- ☆1,042Updated 9 months ago
- Apache Hadoop docker image☆2,240Updated last year
- Docker build for Apache Spark☆673Updated 3 years ago
- A simple spark standalone cluster for your testing environment purposses☆568Updated last year
- Apache Flink docker image☆193Updated 2 years ago
- ☆250Updated 2 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,346Updated 3 weeks ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,005Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆484Updated 2 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,111Updated 2 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆163Updated 4 years ago
- Python interface to Hive and Presto. 🐝☆1,678Updated 7 months ago
- REST job server for Apache Spark☆2,836Updated 2 months ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆905Updated 4 months ago
- Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond☆1,912Updated last week
- ETL best practices with airflow, with examples☆1,321Updated 6 months ago
- Mirror of Apache griffin☆1,152Updated 2 months ago
- A connector for Spark that allows reading and writing to/from Redis cluster☆946Updated 5 months ago
- Run Hadoop Custer within Docker Containers☆1,808Updated 8 months ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆553Updated 3 years ago
- Multi-container environment with Hadoop, Spark and Hive☆210Updated last year
- The Internals of Apache Spark☆1,491Updated 6 months ago
- A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.☆1,019Updated this week
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,032Updated this week
- Hadoop docker image☆1,211Updated 4 years ago
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,935Updated this week
- Examples for High Performance Spark☆506Updated 4 months ago
- The Internals of Spark SQL☆463Updated 2 months ago
- MLeap: Deploy ML Pipelines to Production☆1,515Updated 3 months ago