☆26Mar 18, 2016Updated 9 years ago
Alternatives and similar repositories for hadoopUtils
Users that are interested in hadoopUtils are comparing it to the libraries listed below
Sorting:
- Airflow script for incremental data import from Mysql to Hive using Sqoop.☆18Jun 6, 2018Updated 7 years ago
- This repository has the code from the text and the videos for "Introduction to Programming and Problem Solving using Scala".☆30Feb 11, 2018Updated 8 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Sep 25, 2014Updated 11 years ago
- QA dashboard for DV360 advertisers☆13Jan 20, 2021Updated 5 years ago
- ☆10May 16, 2022Updated 3 years ago
- A clean online résumé (CV)☆13Jun 6, 2024Updated last year
- A shell script to automate the operations of sqoop☆11Mar 29, 2021Updated 4 years ago
- Hackerank Programming Challenges☆10May 8, 2021Updated 4 years ago
- This is a list of YAML file examples for Docker, Kubernetes, Ansible. Also includes a Python script.☆10Jan 12, 2021Updated 5 years ago
- GPO Bypass is a tool / proof-of-concept that highlights how one can bypass Group Policy enforced policies. It uses Firefox as an example.☆14Jan 28, 2023Updated 3 years ago
- Compares various time-series feature sets on computational performance, within-set structure, and between-set relationships.☆11Jun 3, 2022Updated 3 years ago
- ☆194Jun 21, 2022Updated 3 years ago
- On-demand port forwarding to k8s.☆23Feb 7, 2026Updated last month
- Deep Learning (PyTorch) Models Deployment using SQL databases☆10Jul 25, 2021Updated 4 years ago
- Spark implementation of Slowly Changing Dimension type 2☆11Jan 8, 2019Updated 7 years ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- Multi-modality Hierarchical Recall based on GBDTs for Bipolar Disorder Classification☆10Jul 12, 2023Updated 2 years ago
- ☆10Dec 5, 2022Updated 3 years ago
- Two-day level 300 Azure Synapse Analytics workshop☆11Mar 16, 2021Updated 4 years ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆12Mar 10, 2020Updated 5 years ago
- Reply to hot /r/NFTsMarketplace airdrop posts with your wallet address☆11Aug 27, 2023Updated 2 years ago
- 🌎👨💻 The source files and content for my personal site.☆11May 1, 2023Updated 2 years ago
- Neue Scraper☆10Feb 1, 2026Updated last month
- ansible with kubernetes☆10Feb 14, 2023Updated 3 years ago
- Demo of an In-database processing tool for scikit-learn☆13Oct 18, 2022Updated 3 years ago
- Java OutOfMemory Example☆11Jun 19, 2021Updated 4 years ago
- Auto Generate Airflow's dag.py On The Fly☆10Feb 10, 2025Updated last year
- Sample demo to deploy an Apache Kafka cluster and monitor it using Strimzi, Grafana and Prometheus operators.☆10May 18, 2021Updated 4 years ago
- A boilerplate project for Azure Big Data PaaS services☆14Dec 7, 2022Updated 3 years ago
- ☆10Feb 24, 2021Updated 5 years ago
- This a simple Python daemon to monitor your Impala nodes.☆10Apr 13, 2021Updated 4 years ago
- ☆13Dec 5, 2022Updated 3 years ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Portfolio Site☆18Dec 28, 2025Updated 2 months ago
- Reusable Python classes that extend open source PySpark capabilities. Examples of implementation is available under notebooks of repo htt…☆13Nov 1, 2024Updated last year
- NLP text recommendation system built in Python using Gensim, spaCy, and Plotly Dash☆15Mar 8, 2018Updated 8 years ago
- Redshift Python library for user agent detection (browsers, devices, etc) and parsing via UDFs☆10May 27, 2020Updated 5 years ago
- docs, codes and resources to prepare for the CRT020: Databricks Certified Associate Developer for Apache Spark 2.4 with Python 3 certific…☆10Sep 25, 2019Updated 6 years ago
- Metabase Teradata Driver shipped as 3rd party plugin☆11Dec 1, 2025Updated 3 months ago