Examples for Apache Oozie book
☆18May 30, 2016Updated 9 years ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- Tool for visualizing Apache Oozie pipelines☆12Feb 15, 2016Updated 10 years ago
- Applied Data Science by Geoffrey Link☆10Feb 13, 2026Updated last month
- Simple Spark example of generating table stats for use of data quality checks☆28Apr 28, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆44Jul 24, 2017Updated 8 years ago
- Instant search for Sphinx☆14Apr 5, 2023Updated 3 years ago
- In this very simple Docker Swarm Demo we create Docker hosts with Docker Machine and install after this a small Elasticsearch cluster.☆12Jul 31, 2016Updated 9 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Feb 10, 2016Updated 10 years ago
- Node utility for captioning images via imageMagick☆12Aug 13, 2015Updated 10 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Jun 16, 2016Updated 9 years ago
- File compaction tool that runs on top of the Spark framework.☆59Apr 17, 2019Updated 6 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38May 14, 2019Updated 6 years ago
- ammonite vi mode ivy☆10Dec 1, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- Example of using pyinotify to emulate GNU tail -F (follows rotated log file)☆22May 25, 2012Updated 13 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆36Mar 20, 2026Updated 3 weeks ago
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- An example PySpark project with pytest☆18Oct 13, 2017Updated 8 years ago
- An automated, opinionated way to deploy to AWS Lambda☆13Mar 22, 2018Updated 8 years ago
- ☆11Mar 29, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- HBase.MCC (HBase Multi Cluster Client). The goal is to support aways up solutions with HBase through multiple clusters☆14Nov 9, 2015Updated 10 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 3 years ago
- Segment's bundled integration for Firebase on iOS☆13Mar 26, 2024Updated 2 years ago
- My HackerRank Solutions : https://www.hackerrank.com/RohanKhude☆12Jul 13, 2016Updated 9 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Jul 27, 2016Updated 9 years ago
- A template for starting a scala project with sbt☆35May 16, 2023Updated 2 years ago
- Programming Hive读书笔记☆12May 29, 2014Updated 11 years ago
- ☆18Sep 7, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Apache Storm 0.9.3-rc1 Docker cluster deployed on Apache Mesos with Marathon.☆11Jan 5, 2015Updated 11 years ago
- w3m config files☆16Oct 10, 2010Updated 15 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 2 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- Cloudera Manager CM API Python end-to-end example☆15Aug 29, 2019Updated 6 years ago
- Tools for Hadoop☆24Feb 27, 2012Updated 14 years ago