☆26Mar 18, 2016Updated 10 years ago
Alternatives and similar repositories for hadoopUtils
Users that are interested in hadoopUtils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the Pig Latin scripts, UDFs and datasets used in the book Pig Design Patterns by Pradeep Pasupuleti, published b…☆23Apr 9, 2014Updated 12 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Sep 25, 2014Updated 11 years ago
- Collection of Pig scripts that I use for my talks and workshops☆39Apr 30, 2013Updated 13 years ago
- ☆44Jul 24, 2017Updated 8 years ago
- A shell script to automate the operations of sqoop☆11Mar 29, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆195Jun 21, 2022Updated 3 years ago
- QA dashboard for DV360 advertisers☆13Jan 20, 2021Updated 5 years ago
- this is a db-hdfs tools used to transfer big database datas to hadoop hdfs like sqoop,but bboss bigdata tool is very nice monitor and ev…☆27Nov 17, 2025Updated 5 months ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆26Nov 30, 2019Updated 6 years ago
- Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations☆28Oct 15, 2011Updated 14 years ago
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- A reusable workflow to show how to orchestrate many iterations of an action concurrently, in a single pane of glass. See medium write-up …☆12Nov 8, 2024Updated last year
- Code to support Databases blog post - How to offload data from your transactional NoSQL database to Amazon S3, perform advanced analytics…☆15Mar 26, 2020Updated 6 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Apache Beam example project☆13Oct 16, 2019Updated 6 years ago
- Airflow script for incremental data import from Mysql to Hive using Sqoop.☆18Jun 6, 2018Updated 7 years ago
- This is the collection of some handy tips running Nexus Repository Manager OSS☆14Aug 20, 2016Updated 9 years ago
- ☆21Jun 23, 2019Updated 6 years ago
- Example script to deploy DAGs to Google Cloud Composer.☆15Jun 30, 2022Updated 3 years ago
- Data and example code for Programming Pig, by Alan F. Gates☆186Oct 15, 2016Updated 9 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆152May 1, 2024Updated 2 years ago
- ☆14Aug 10, 2021Updated 4 years ago
- Reference Architectures for Apache Spark☆38Jan 23, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Integrate Grafana with Ambari Metrics System☆27Jun 13, 2025Updated 10 months ago
- Dockerfiles of wocker/wocker for Wocker.☆14Aug 13, 2020Updated 5 years ago
- scala and spark examples project☆14Feb 19, 2018Updated 8 years ago
- This repository is created for TechCommanders and O'Reilly Students who have taken the Google Cloud Professional Security Engineer Crash …☆16Jul 27, 2021Updated 4 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- Tutorial for Cloud Dataflow☆17Mar 12, 2019Updated 7 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Oct 20, 2017Updated 8 years ago
- Reuse Jenkinsfiles across repositories and hydrate commands and settings with config from each repository☆23Mar 9, 2023Updated 3 years ago
- Given a file and a chunk size in megabytes, calculates what the Amazon S3 etag will be.☆16Aug 7, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Play framework template based on SB-Admin-2☆13Mar 13, 2015Updated 11 years ago
- ☆10Jun 16, 2015Updated 10 years ago
- Dockerfile and associated other stuff for building a LAMP stack☆77Nov 8, 2013Updated 12 years ago
- A stack overflow for Apache Spark☆72Apr 26, 2017Updated 9 years ago
- Playframework module to create automatic sitemaps☆19Jan 28, 2019Updated 7 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Dec 13, 2017Updated 8 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago