sourygnahtw/hadoopUtils

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sourygnahtw/hadoopUtils)

sourygnahtw / hadoopUtils

☆26

Alternatives and similar repositories for hadoopUtils

Users that are interested in hadoopUtils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnalyticsDojo / materials
View on GitHub
Jupyter Notebooks for Data Science
☆12Jan 12, 2017Updated 9 years ago
gwenshap / sqoop2hive
View on GitHub
Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table
☆30Sep 25, 2014Updated 11 years ago
sudar / pig-samples
View on GitHub
Collection of Pig scripts that I use for my talks and workshops
☆39Apr 30, 2013Updated 13 years ago
NikhilDhiman / SQOOP-Automation
View on GitHub
A shell script to automate the operations of sqoop
☆11Mar 29, 2021Updated 5 years ago
AnalyticsDojo / AnalyticsDojo
View on GitHub
The http://analyticsdojo.com open source codebase and curriculum. Learn to data science today.
☆38Dec 13, 2016Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
oreillymedia / programming_hive
View on GitHub
☆44Jul 24, 2017Updated 9 years ago
dgadiraju / code
View on GitHub
☆194Jun 21, 2022Updated 4 years ago
xmlking / cdc-kafka-hadoop
View on GitHub
MySQL to NoSQL real time dataflow
☆19Oct 14, 2017Updated 8 years ago
akashmehta10 / cdc_pyspark_hive
View on GitHub
☆23Nov 17, 2022Updated 3 years ago
Apress / pyspark-recipes
View on GitHub
Source code for 'PySpark Recipes' by Raju Kumar Mishra
☆26Nov 30, 2019Updated 6 years ago
dbist / oozie-examples
View on GitHub
sample oozie workflows
☆17Jun 13, 2017Updated 9 years ago
KyMidd / github-reusable-actions-terraform-concurrency
View on GitHub
A reusable workflow to show how to orchestrate many iterations of an action concurrently, in a single pane of glass. See medium write-up …
☆13Nov 8, 2024Updated last year
smart-inner / smarttune
View on GitHub
SmartTune is a black-box optimization that can automatically find good performance settings for a complex system's configuration knobs.
☆11Nov 23, 2022Updated 3 years ago
RajeshHegde / apache-beam-example
View on GitHub
Apache Beam example project
☆13Oct 16, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MarkCLewis / ProblemSolvingUsingScala
View on GitHub
This repository has the code from the text and the videos for "Introduction to Programming and Problem Solving using Scala".
☆30Feb 11, 2018Updated 8 years ago
eastcirclek / swimlane-graphs
View on GitHub
Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs
☆13Feb 13, 2017Updated 9 years ago
aws-samples / glue-enrich-cost-and-usage
View on GitHub
Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…
☆16Mar 14, 2021Updated 5 years ago
tharsha18 / gluelabs
View on GitHub
☆14Aug 10, 2021Updated 4 years ago
inistar / cicd-cloud-composer
View on GitHub
Example script to deploy DAGs to Google Cloud Composer.
☆15Jun 30, 2022Updated 4 years ago
miguno / avro-cli-examples
View on GitHub
Examples on how to use the command line tools in Avro Tools to read and write Avro files
☆152May 1, 2024Updated 2 years ago
thecloudtechguy / GCP-Security-Engineer-Crash-Course
View on GitHub
This repository is created for TechCommanders and O'Reilly Students who have taken the Google Cloud Professional Security Engineer Crash …
☆16Jul 27, 2021Updated 5 years ago
samberic / wizchat
View on GitHub
☆10Jan 19, 2016Updated 10 years ago
eapowertools-archive / qs-event-driven-cross-site-app-promoter
View on GitHub
Unsupported - Event-driven cross-site app promotion utility using the notification endpoint of the QRS API and Python.
☆14Feb 1, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
oddy / bangingminimal
View on GitHub
Minimal app for demonstrating use of flask-security
☆18Jul 6, 2018Updated 8 years ago
blueskydigital / react-mobx-admin
View on GitHub
unopinionated framework for React based admin applications
☆10May 4, 2021Updated 5 years ago
MarkCLewis / BigDataAnalyticswithSpark
View on GitHub
Code for my videos on big data analytics with Apache Spark using Scala.
☆62Feb 11, 2018Updated 8 years ago
josephmachado / spark_submit_airflow
View on GitHub
Simple repo to demonstrate how to submit a spark job to EMR from Airflow
☆34Oct 18, 2020Updated 5 years ago
jlopezmalla / Flights
View on GitHub
scala and spark examples project
☆14Feb 19, 2018Updated 8 years ago
rdempsey / pyspark-for-data-processing
View on GitHub
Code for my presentation: Using PySpark to Process Boat Loads of Data
☆20Oct 20, 2017Updated 8 years ago
jenkinsci / config-driven-pipeline-plugin
View on GitHub
Reuse Jenkinsfiles across repositories and hydrate commands and settings with config from each repository
☆23Mar 9, 2023Updated 3 years ago
tlastowka / calculate_multipart_etag
View on GitHub
Given a file and a chunk size in megabytes, calculates what the Amazon S3 etag will be.
☆16Aug 7, 2020Updated 5 years ago
AMIS-Services / code-cafe-20200504
View on GitHub
Resources for Code Cafe Online 4th May 2020
☆11May 5, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mamoonraja / analyze-call-center-calls-with-watson
View on GitHub
Hands-on Python + IBM Watson lab being presented at IBM Think2018 conference in March 2018
☆18Apr 25, 2018Updated 8 years ago
malo-denielou / DataflowSME
View on GitHub
Tutorial for Cloud Dataflow
☆17Mar 12, 2019Updated 7 years ago
iateadonut / laravel51-email-authentication
View on GitHub
☆10Jun 16, 2015Updated 11 years ago
marceloemanoel / play-sb-admin2
View on GitHub
Play framework template based on SB-Admin-2
☆13Mar 13, 2015Updated 11 years ago
absognety / atomic-scala
View on GitHub
Atomic Scala Book Solutions - for Beginners and first time Functional Programmers
☆12Mar 10, 2020Updated 6 years ago
AllenFang / spark-overflow
View on GitHub
A stack overflow for Apache Spark
☆72Apr 26, 2017Updated 9 years ago
spider-123-eng / Spark
View on GitHub
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficient…
☆54Nov 16, 2022Updated 3 years ago