Hadoop FSImage Analyzer (HFSA)
☆66Feb 24, 2026Updated this week
Alternatives and similar repositories for hfsa
Users that are interested in hfsa are comparing it to the libraries listed below
Sorting:
- Exports Hadoop HDFS content statistics to Prometheus☆163Updated this week
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated 10 months ago
- Scalable NameNode RPC Proxy for HDFS Federation☆87Apr 19, 2016Updated 9 years ago
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- ☆15Oct 12, 2021Updated 4 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆72Jan 1, 2023Updated 3 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 4 months ago
- ☆393Jan 25, 2024Updated 2 years ago
- NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.☆120Nov 25, 2025Updated 3 months ago
- Java event logs collector for hadoop and frameworks☆41Mar 25, 2025Updated 11 months ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 7 months ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated 2 weeks ago
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 5 months ago
- presto for cloudera manager☆15Apr 22, 2016Updated 9 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Sep 17, 2025Updated 5 months ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆284Updated this week
- Plugin to create fake visits, websites, users and goals to populate Matomo reports☆22Feb 4, 2026Updated 3 weeks ago
- Vagrant / Ansible environment to deploy a local TDP cluster☆20Jan 9, 2026Updated last month
- ☆17Mar 19, 2024Updated last year
- Visualize your HDFS cluster usage☆228Oct 13, 2020Updated 5 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆128Sep 7, 2018Updated 7 years ago
- ByteBuffer utilities using Unsafe for fast reads.☆22Apr 4, 2014Updated 11 years ago
- This plugin lets you document your physical servers across multiple datacenters☆21Nov 17, 2020Updated 5 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆283Jul 28, 2023Updated 2 years ago
- Solr exporter for prometheus.☆28May 20, 2019Updated 6 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Apr 23, 2019Updated 6 years ago
- ☆72Feb 21, 2026Updated last week
- Hadoop exporter☆53Jan 27, 2020Updated 6 years ago
- Storage Benchmark Kit☆33Nov 5, 2025Updated 3 months ago
- A hadoop compatible FUSE use for all.☆29Sep 25, 2024Updated last year
- CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source bi…☆489Oct 31, 2025Updated 4 months ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 2 years ago
- Advanced block device testing/file system testing, targetting SNIA compatible reporting☆12Oct 15, 2025Updated 4 months ago
- Choregraphie offers primitive to coordinate convergence of chef resources.☆30Jan 15, 2025Updated last year
- StarRocks慢查询监控☆46Dec 22, 2025Updated 2 months ago
- ☆74Oct 7, 2013Updated 12 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,371Aug 22, 2023Updated 2 years ago
- Lustre Repository with MS patches☆13Updated this week