☆26Mar 18, 2016Updated 10 years ago
Alternatives and similar repositories for hadoopUtils
Users that are interested in hadoopUtils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the Pig Latin scripts, UDFs and datasets used in the book Pig Design Patterns by Pradeep Pasupuleti, published b…☆23Apr 9, 2014Updated 11 years ago
- Jupyter Notebooks for Data Science☆12Jan 12, 2017Updated 9 years ago
- The http://analyticsdojo.com open source codebase and curriculum. Learn to data science today.☆38Dec 13, 2016Updated 9 years ago
- ☆195Jun 21, 2022Updated 3 years ago
- ☆59Oct 17, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations☆28Oct 15, 2011Updated 14 years ago
- Export JSON, CSV, XML, XLSX from a Oracle DB using the command line☆14Dec 16, 2016Updated 9 years ago
- ☆23Nov 17, 2022Updated 3 years ago
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Find people to play pickup sports with☆21Mar 6, 2016Updated 10 years ago
- SmartTune is a black-box optimization that can automatically find good performance settings for a complex system's configuration knobs.☆11Nov 23, 2022Updated 3 years ago
- A reusable workflow to show how to orchestrate many iterations of an action concurrently, in a single pane of glass. See medium write-up …☆12Nov 8, 2024Updated last year
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- Apache Beam example project☆13Oct 16, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs☆13Feb 13, 2017Updated 9 years ago
- Airflow script for incremental data import from Mysql to Hive using Sqoop.☆18Jun 6, 2018Updated 7 years ago
- Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…☆16Mar 14, 2021Updated 5 years ago
- This is the collection of some handy tips running Nexus Repository Manager OSS☆14Aug 20, 2016Updated 9 years ago
- ☆20Jun 23, 2019Updated 6 years ago
- Data and example code for Programming Pig, by Alan F. Gates☆186Oct 15, 2016Updated 9 years ago
- A simple pre-made personal website with blogging and social integrations☆38Nov 12, 2023Updated 2 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆152May 1, 2024Updated last year
- ☆14Aug 10, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Integrate Grafana with Ambari Metrics System☆27Jun 13, 2025Updated 9 months ago
- ☆10Jan 19, 2016Updated 10 years ago
- Minimal app for demonstrating use of flask-security☆18Jul 6, 2018Updated 7 years ago
- Unsupported - Event-driven cross-site app promotion utility using the notification endpoint of the QRS API and Python.☆14Feb 1, 2021Updated 5 years ago
- ☆12Jan 30, 2024Updated 2 years ago
- Various scripts related to project☆19Feb 23, 2026Updated last month
- scala and spark examples project☆14Feb 19, 2018Updated 8 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- nconf wrapper that simplifies work with environment specific configuration files☆15Aug 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Tutorial for Cloud Dataflow☆17Mar 12, 2019Updated 7 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Oct 20, 2017Updated 8 years ago
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- Given a file and a chunk size in megabytes, calculates what the Amazon S3 etag will be.☆16Aug 7, 2020Updated 5 years ago
- Play framework template based on SB-Admin-2☆13Mar 13, 2015Updated 11 years ago
- Resources for Code Cafe Online 4th May 2020☆11May 5, 2020Updated 5 years ago
- A stack overflow for Apache Spark☆72Apr 26, 2017Updated 8 years ago