paypal/NNAnalytics

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/paypal/NNAnalytics)

paypal / NNAnalytics

NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.

☆121

Alternatives and similar repositories for NNAnalytics

Users that are interested in NNAnalytics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

paypal / gimel
View on GitHub
Big Data Processing Framework - Unified Data API or SQL on Any Storage
☆252Jul 10, 2025Updated last year
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
marcelmay / hfsa
View on GitHub
Hadoop FSImage Analyzer (HFSA)
☆68Jun 24, 2026Updated 3 weeks ago
ExpediaGroup / waggle-dance
View on GitHub
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
☆288Jun 25, 2026Updated 3 weeks ago
ashrithr / ankus
View on GitHub
ANKUS is a deployment & orchestration tool for big data frameworks
☆20Apr 3, 2015Updated 11 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ExpediaGroup / datasqueeze
View on GitHub
Hadoop utility to compact small files
☆18Feb 16, 2026Updated 5 months ago
yaooqinn / spark-authorizer
View on GitHub
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆183Apr 6, 2022Updated 4 years ago
linyiqun / yarn-jobhistory-crawler
View on GitHub
JobHistory上的job信息爬取工具
☆34Nov 11, 2015Updated 10 years ago
Intel-bigdata / SSM
View on GitHub
Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution
☆139Jan 3, 2023Updated 3 years ago
NII-cloud-operation / Literate-computing-Hadoop
View on GitHub
Literate Computing for Reproducible Infrastructure - Hadoop Practice
☆11Mar 5, 2026Updated 4 months ago
sforteln / HdfsBlockFinder
View on GitHub
Allows you to see where(datanodes) that contain a file in HDFS
☆17Mar 16, 2013Updated 13 years ago
hortonworks-spark / spark-atlas-connector
View on GitHub
A Spark Atlas connector to track data lineage in Apache Atlas
☆268Nov 16, 2022Updated 3 years ago
marcelmay / hadoop-hdfs-fsimage-exporter
View on GitHub
Exports Hadoop HDFS content statistics to Prometheus
☆163Jun 24, 2026Updated 3 weeks ago
liancheng / spear
View on GitHub
A playground for experimenting ideas that may apply to Spark SQL/Catalyst
☆143Jul 5, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
openaire / vipe
View on GitHub
Tool for visualizing Apache Oozie pipelines
☆13Feb 15, 2016Updated 10 years ago
prestodb / presto-yarn
View on GitHub
☆58Mar 27, 2019Updated 7 years ago
ExpediaGroup / shunting-yard
View on GitHub
Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.
☆20Oct 11, 2021Updated 4 years ago
qubole / sparklens
View on GitHub
Qubole Sparklens tool for performance tuning Apache Spark
☆592Jun 26, 2024Updated 2 years ago
cerndb / hdfs-metadata
View on GitHub
Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…
☆55May 9, 2017Updated 9 years ago
apache / incubator-crail
View on GitHub
Mirror of Apache crail (Incubating)
☆152Jul 3, 2022Updated 4 years ago
yaooqinn / spark-history-cli
View on GitHub
CLI tool for querying Apache Spark History Server REST API
☆28Mar 22, 2026Updated 3 months ago
aw-was-here / eco-release-metadata
View on GitHub
Apache Project Changes and Release Notes as generated by Apache Yetus
☆10Nov 14, 2020Updated 5 years ago
hortonworks / hive-testbench
View on GitHub
☆392Jan 25, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
prestodb / presto-hadoop-apache
View on GitHub
Shaded version of Apache Hadoop 2.x for Presto
☆16Sep 16, 2025Updated 10 months ago
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
bytedance / nnproxy
View on GitHub
Scalable NameNode RPC Proxy for HDFS Federation
☆89Apr 19, 2016Updated 10 years ago
amplab / drizzle-spark
View on GitHub
Drizzle integration with Apache Spark
☆120Sep 11, 2018Updated 7 years ago
mhausenblas / mc
View on GitHub
A Simple Mesos-DNS Client
☆10Jun 20, 2015Updated 11 years ago
apache / griffin
View on GitHub
Mirror of Apache griffin
☆1,169Aug 3, 2025Updated 11 months ago
king / bravo
View on GitHub
Utilities for processing Flink checkpoints/savepoints
☆75Dec 11, 2019Updated 6 years ago
ckljohn / airflow-on-EKS
View on GitHub
☆12Sep 2, 2021Updated 4 years ago
apache / yunikorn-release
View on GitHub
Apache YuniKorn Release
☆45Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
gateway-experiments / hadoop-yarn-api-python-client
View on GitHub
Python client for Hadoop® YARN API
☆109Sep 26, 2022Updated 3 years ago
xavient / CDS
View on GitHub
Content Data Store (HDFS/HBase)
☆13Dec 1, 2016Updated 9 years ago
dharmeshkakadia / presto-kubernetes
View on GitHub
Running Presto on k8s
☆38Aug 26, 2019Updated 6 years ago
skaiworldwide-oss / postgres-xl-ha
View on GitHub
☆11Jun 26, 2017Updated 9 years ago
hammerlab / spark-json-relay
View on GitHub
SparkListener that converts SparkListenerEvents to JSON and forwards them to an external service via RPC.
☆16Apr 6, 2021Updated 5 years ago
yu-iskw / spark-ranking-algorithms
View on GitHub
Ranking algorithms for Spark machine learning pipeline
☆14Jan 6, 2018Updated 8 years ago
openankus / ankus
View on GitHub
Numeric / Norminal Statistics, Certainty Factor, Normalize, ETL, TF-IDF, Discretization on Hadoop MapReduce
☆11Jun 28, 2016Updated 10 years ago