mirkoprescha / spark-zeppelin-docker
docker image with spark and zeppelin
☆12Updated 5 years ago
Alternatives and similar repositories for spark-zeppelin-docker:
Users that are interested in spark-zeppelin-docker are comparing it to the libraries listed below
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆50Updated last year
- Real-world Spark pipelines examples☆83Updated 7 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Updated 5 years ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 5 years ago
- ☆72Updated 4 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Parcel for Apache Airflow☆17Updated 5 years ago
- Kite SDK Examples☆99Updated 3 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Updated 7 years ago
- ☆41Updated 7 years ago
- Apache Spark docker container image (Standalone mode)☆35Updated 4 years ago
- Nested array transformation helper extensions for Apache Spark☆37Updated last year
- A sample implementation of the Spark Datasource API☆23Updated 7 years ago
- Examples of Spark 3.0☆47Updated 4 years ago
- Demonstration code for MLeap, both Jupyter notebooks and projects☆24Updated 5 years ago
- Avro record class and reader generator☆20Updated 2 years ago
- Wikipedia stream-processing demo using Kafka Connect and Kafka Streams.☆75Updated 7 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago