Examples for Apache Oozie book
☆18May 30, 2016Updated 9 years ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- Maven 构建Spring,Hibernate,Struts2 web项目☆20Jun 9, 2016Updated 9 years ago
- Applied Data Science by Geoffrey Link☆10Feb 13, 2026Updated 2 months ago
- Simple Spark example of generating table stats for use of data quality checks☆27Apr 28, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆44Jul 24, 2017Updated 8 years ago
- In this very simple Docker Swarm Demo we create Docker hosts with Docker Machine and install after this a small Elasticsearch cluster.☆12Jul 31, 2016Updated 9 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Feb 10, 2016Updated 10 years ago
- Node utility for captioning images via imageMagick☆12Aug 13, 2015Updated 10 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Jun 16, 2016Updated 9 years ago
- File compaction tool that runs on top of the Spark framework.☆59Apr 17, 2019Updated 7 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆37May 14, 2019Updated 6 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- Example of using pyinotify to emulate GNU tail -F (follows rotated log file)☆22May 25, 2012Updated 13 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆37Mar 20, 2026Updated last month
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- An automated, opinionated way to deploy to AWS Lambda☆13Mar 22, 2018Updated 8 years ago
- ☆11Mar 29, 2016Updated 10 years ago
- Data models for Segment built using dbt (getdbt.com).☆12Jul 31, 2024Updated last year
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Segment's bundled integration for Firebase on iOS☆13Mar 26, 2024Updated 2 years ago
- ☆20Apr 27, 2012Updated 14 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Jul 27, 2016Updated 9 years ago
- A template for starting a scala project with sbt☆35May 16, 2023Updated 2 years ago
- Configures and builds a database for engagement events generated by Amazon Simple Email Service (SES) and Amazon Pinpoint engagements usi…☆13Jan 16, 2025Updated last year
- ☆18Sep 7, 2014Updated 11 years ago
- Apache Storm 0.9.3-rc1 Docker cluster deployed on Apache Mesos with Marathon.☆11Jan 5, 2015Updated 11 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- Cloudera Manager CM API Python end-to-end example☆15Aug 29, 2019Updated 6 years ago
- Mapless is a small framework for storing objects in a key->data fashion (i.e.: noSQL databases) without requiring any kind of object-data…☆10Feb 14, 2020Updated 6 years ago
- An app built on Cloudera Enterprise for tracking metrics of jobs that run in YARN framework☆13Feb 5, 2016Updated 10 years ago
- ☆15Jan 17, 2022Updated 4 years ago
- ☆18May 1, 2016Updated 10 years ago
- Trident state implementation for Redis☆29Dec 18, 2023Updated 2 years ago