Examples for Apache Oozie book
☆18May 30, 2016Updated 9 years ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- Tool for visualizing Apache Oozie pipelines☆12Feb 15, 2016Updated 10 years ago
- Applied Data Science by Geoffrey Link☆10Feb 13, 2026Updated last month
- Simple Spark example of generating table stats for use of data quality checks☆28Apr 28, 2017Updated 8 years ago
- ☆44Jul 24, 2017Updated 8 years ago
- In this very simple Docker Swarm Demo we create Docker hosts with Docker Machine and install after this a small Elasticsearch cluster.☆12Jul 31, 2016Updated 9 years ago
- Node utility for captioning images via imageMagick☆12Aug 13, 2015Updated 10 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38May 14, 2019Updated 6 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 9 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆36Mar 9, 2026Updated last week
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- An example PySpark project with pytest☆18Oct 13, 2017Updated 8 years ago
- An automated, opinionated way to deploy to AWS Lambda☆13Mar 22, 2018Updated 8 years ago
- HBase.MCC (HBase Multi Cluster Client). The goal is to support aways up solutions with HBase through multiple clusters☆14Nov 9, 2015Updated 10 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- My HackerRank Solutions : https://www.hackerrank.com/RohanKhude☆12Jul 13, 2016Updated 9 years ago
- Open Network Insight Documents - this is a repository for images and collateral. Visit the wiki at https://github.com/Open-Network-Insi…☆10Sep 21, 2016Updated 9 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Jul 27, 2016Updated 9 years ago
- A template for starting a scala project with sbt☆35May 16, 2023Updated 2 years ago
- Programming Hive读书笔记☆12May 29, 2014Updated 11 years ago
- Configures and builds a database for engagement events generated by Amazon Simple Email Service (SES) and Amazon Pinpoint engagements usi…☆13Jan 16, 2025Updated last year
- ☆18Sep 7, 2014Updated 11 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 2 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- Mapless is a small framework for storing objects in a key->data fashion (i.e.: noSQL databases) without requiring any kind of object-data…☆10Feb 14, 2020Updated 6 years ago
- Demo code for the Timely Security Analytics and Analysis 2015 Re:Invent presentation.☆29Jan 30, 2020Updated 6 years ago
- An app built on Cloudera Enterprise for tracking metrics of jobs that run in YARN framework☆13Feb 5, 2016Updated 10 years ago
- ☆15Jan 17, 2022Updated 4 years ago
- ☆18May 1, 2016Updated 9 years ago
- Trident state implementation for Redis☆29Dec 18, 2023Updated 2 years ago
- PGT allows you to generate pcaps using python without touching the network in any way. It is dependent upon scapy.☆29Jan 3, 2022Updated 4 years ago
- The ElasticSearch View Plugin provides a simple way to render ElasticSearch documents in HTML, XML or text☆48Mar 3, 2013Updated 13 years ago
- Algorithms and Data Structures implemented in Java☆12Jul 28, 2019Updated 6 years ago
- Layout & typography for LaTeX books using the memoir document class☆10Aug 22, 2024Updated last year
- Ansible role for provisioning EC2 instances☆20Aug 27, 2015Updated 10 years ago
- Data Quality Monitoring Tool☆15Dec 5, 2017Updated 8 years ago