☆26Mar 18, 2016Updated 10 years ago
Alternatives and similar repositories for hadoopUtils
Users that are interested in hadoopUtils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the Pig Latin scripts, UDFs and datasets used in the book Pig Design Patterns by Pradeep Pasupuleti, published b…☆23Apr 9, 2014Updated 12 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Sep 25, 2014Updated 11 years ago
- Collection of Pig scripts that I use for my talks and workshops☆39Apr 30, 2013Updated 13 years ago
- MySQL to NoSQL real time dataflow☆19Oct 14, 2017Updated 8 years ago
- ☆195Jun 21, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- QA dashboard for DV360 advertisers☆13Jan 20, 2021Updated 5 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆26Nov 30, 2019Updated 6 years ago
- Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations☆28Oct 15, 2011Updated 14 years ago
- ☆23Nov 17, 2022Updated 3 years ago
- SmartTune is a black-box optimization that can automatically find good performance settings for a complex system's configuration knobs.☆11Nov 23, 2022Updated 3 years ago
- Code to support Databases blog post - How to offload data from your transactional NoSQL database to Amazon S3, perform advanced analytics…☆15Mar 26, 2020Updated 6 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- Apache Beam example project☆13Oct 16, 2019Updated 6 years ago
- This repository has the code from the text and the videos for "Introduction to Programming and Problem Solving using Scala".☆30Feb 11, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…☆16Mar 14, 2021Updated 5 years ago
- Data and example code for Programming Pig, by Alan F. Gates☆186Oct 15, 2016Updated 9 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆152May 1, 2024Updated 2 years ago
- ☆14Aug 10, 2021Updated 4 years ago
- ☆11Aug 1, 2022Updated 3 years ago
- Integrate Grafana with Ambari Metrics System☆27Jun 13, 2025Updated 11 months ago
- ☆10Jan 19, 2016Updated 10 years ago
- unopinionated framework for React based admin applications☆10May 4, 2021Updated 5 years ago
- Minimal app for demonstrating use of flask-security☆18Jul 6, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for my videos on big data analytics with Apache Spark using Scala.☆62Feb 11, 2018Updated 8 years ago
- scala and spark examples project☆14Feb 19, 2018Updated 8 years ago
- Generate a Redshift .manifest file for a given S3 bucket☆21Nov 16, 2017Updated 8 years ago
- This repository is created for TechCommanders and O'Reilly Students who have taken the Google Cloud Professional Security Engineer Crash …☆16Jul 27, 2021Updated 4 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- Tutorial for Cloud Dataflow☆17Mar 12, 2019Updated 7 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Oct 20, 2017Updated 8 years ago
- Reuse Jenkinsfiles across repositories and hydrate commands and settings with config from each repository☆23Mar 9, 2023Updated 3 years ago
- Given a file and a chunk size in megabytes, calculates what the Amazon S3 etag will be.☆16Aug 7, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- My vim configuration☆14Jul 8, 2022Updated 3 years ago
- Play framework template based on SB-Admin-2☆13Mar 13, 2015Updated 11 years ago
- Amazon AWS login with Google credentials☆13Feb 11, 2019Updated 7 years ago
- Dockerfile and associated other stuff for building a LAMP stack☆77Nov 8, 2013Updated 12 years ago
- ☆22Apr 21, 2014Updated 12 years ago
- A stack overflow for Apache Spark☆72Apr 26, 2017Updated 9 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Dec 13, 2017Updated 8 years ago