hgrif / oozie-pyspark-workflow
Example of an Oozie workflow with a PySpark action using Python eggs
☆14Updated 8 years ago
Alternatives and similar repositories for oozie-pyspark-workflow:
Users that are interested in oozie-pyspark-workflow are comparing it to the libraries listed below
- Simple Spark Application☆76Updated last year
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Code examples and docker environment for Spark☆27Updated 9 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Updated 11 years ago
- ☆24Updated 8 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- Self-contained examples using Apache Spark with the functional features of Java 8☆64Updated 7 years ago
- sample oozie workflows☆18Updated 7 years ago
- An example Apache Beam project.☆111Updated 7 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- ☆75Updated 4 years ago
- ☆81Updated last year
- These are some code examples☆55Updated 5 years ago
- Simple examle for Spark Streaming over Kafka topic☆106Updated 4 years ago
- ☆20Updated 7 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Assets used in Apress -- Scalable Big Data Architecture -- book☆19Updated 9 years ago
- ☆14Updated 9 years ago
- ☆49Updated 5 years ago
- Python client for Spark Jobserver Rest API☆39Updated 5 years ago
- Fixed and updated code examples from the book "Apache Kafka"☆74Updated 8 years ago
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- ☆35Updated 8 years ago
- ☆31Updated 5 years ago
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆50Updated last year
- Example Maven configuration for a Spark, Scala project☆54Updated 3 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 9 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago