mashin-io / oep
The public repo for Oozie Editor Plugin.
☆16Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for oep
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆344Updated 7 years ago
- Utility to easily copy files into HDFS☆69Updated 4 years ago
- Remedy small files by combining them into larger ones.☆193Updated 2 years ago
- Hive SerDe for CSV☆140Updated 3 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆166Updated 9 years ago
- Example programs and scripts for accessing parquet files☆30Updated 6 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated 10 months ago
- Apache Spark and Apache Kafka integration example☆124Updated 6 years ago
- ☆245Updated 6 years ago
- A library to expose more of Apache Spark's metrics system☆146Updated 4 years ago
- Oozie Samples☆51Updated 10 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 7 years ago
- Apache Spark applications☆70Updated 6 years ago
- spark + drools☆101Updated 2 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆219Updated 8 years ago
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated 2 years ago
- Write your Spark data to Kafka seamlessly☆176Updated 4 months ago
- Kafka as Hive Storage☆67Updated 10 years ago
- Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.☆445Updated 10 months ago
- NexR Hive UDFs☆111Updated 9 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- ☆54Updated 10 years ago
- SparkOnHBase☆279Updated 3 years ago
- Examples for Apache Oozie book☆18Updated 8 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆86Updated 8 months ago