Examples for Apache Oozie book
☆18May 30, 2016Updated 10 years ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- Maven 构建Spring,Hibernate,Struts2 web项目☆20Jun 9, 2016Updated 10 years ago
- Simple Spark example of generating table stats for use of data quality checks☆27Apr 28, 2017Updated 9 years ago
- Instant search for Sphinx☆14Apr 5, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- In this very simple Docker Swarm Demo we create Docker hosts with Docker Machine and install after this a small Elasticsearch cluster.☆12Jul 31, 2016Updated 9 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Feb 10, 2016Updated 10 years ago
- Node utility for captioning images via imageMagick☆12Aug 13, 2015Updated 10 years ago
- File compaction tool that runs on top of the Spark framework.☆59Apr 17, 2019Updated 7 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆37May 14, 2019Updated 7 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆38May 7, 2026Updated last month
- An automated, opinionated way to deploy to AWS Lambda☆13Mar 22, 2018Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Data models for Segment built using dbt (getdbt.com).☆12Jul 31, 2024Updated last year
- HBase.MCC (HBase Multi Cluster Client). The goal is to support aways up solutions with HBase through multiple clusters☆14Nov 9, 2015Updated 10 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 3 years ago
- Segment's bundled integration for Firebase on iOS☆13May 20, 2026Updated 3 weeks ago
- ☆20Apr 27, 2012Updated 14 years ago
- Open Network Insight Documents - this is a repository for images and collateral. Visit the wiki at https://github.com/Open-Network-Insi…☆10Sep 21, 2016Updated 9 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Jul 27, 2016Updated 9 years ago
- A template for starting a scala project with sbt☆35May 16, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Programming Hive读书笔记☆12May 29, 2014Updated 12 years ago
- ☆18Sep 7, 2014Updated 11 years ago
- Apache Storm 0.9.3-rc1 Docker cluster deployed on Apache Mesos with Marathon.☆11Jan 5, 2015Updated 11 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 3 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- Cloudera Manager CM API Python end-to-end example☆15Aug 29, 2019Updated 6 years ago
- Mapless is a small framework for storing objects in a key->data fashion (i.e.: noSQL databases) without requiring any kind of object-data…☆10Feb 14, 2020Updated 6 years ago
- Tools for Hadoop☆24Feb 27, 2012Updated 14 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An app built on Cloudera Enterprise for tracking metrics of jobs that run in YARN framework☆13Feb 5, 2016Updated 10 years ago
- ☆15Jan 17, 2022Updated 4 years ago
- Trident state implementation for Redis☆29Dec 18, 2023Updated 2 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Nov 8, 2018Updated 7 years ago
- PGT allows you to generate pcaps using python without touching the network in any way. It is dependent upon scapy.☆29Jan 3, 2022Updated 4 years ago
- The ElasticSearch View Plugin provides a simple way to render ElasticSearch documents in HTML, XML or text☆49Mar 3, 2013Updated 13 years ago
- Layout & typography for LaTeX books using the memoir document class☆10Aug 22, 2024Updated last year