Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
☆129Sep 7, 2018Updated 7 years ago
Alternatives and similar repositories for babar
Users that are interested in babar are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Java event logs collector for hadoop and frameworks☆41Mar 25, 2025Updated last year
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆306Oct 30, 2025Updated 4 months ago
- ☆12May 16, 2017Updated 8 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- An embedded job scheduler.☆117Jul 29, 2024Updated last year
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 2 months ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 2 years ago
- JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter☆1,804Mar 1, 2026Updated 3 weeks ago
- ☆14Sep 18, 2016Updated 9 years ago
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆818Mar 4, 2026Updated 3 weeks ago
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- Hadoop FSImage Analyzer (HFSA)☆67Mar 17, 2026Updated last week
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- ☆18Jan 17, 2025Updated last year
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,369Aug 22, 2023Updated 2 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Jul 7, 2016Updated 9 years ago
- ☆17Mar 19, 2024Updated 2 years ago
- End-to-end SQL fuzz testing for DataFusion using SQLancer☆12Feb 9, 2026Updated last month
- Hadoop utility jar for troubleshooting integration with cloud object stores☆37Mar 3, 2026Updated 3 weeks ago
- Create hadoop cluster in aws ec2 for development☆11Sep 8, 2017Updated 8 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆590Jun 26, 2024Updated last year
- A collection of Apache Parquet add-on modules☆30Updated this week
- Train TensorFlow models on YARN in just a few lines of code!☆93Nov 3, 2023Updated 2 years ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Aug 29, 2022Updated 3 years ago
- ☆103Mar 23, 2020Updated 6 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Mar 26, 2015Updated 10 years ago
- ☆108Jul 5, 2023Updated 2 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆893Mar 10, 2026Updated 2 weeks ago
- Using log4j insert log info into ElasticSearch☆26Oct 31, 2016Updated 9 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆36Mar 9, 2026Updated 2 weeks ago
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆56Jan 2, 2023Updated 3 years ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated last month
- Serverless proxy for Spark cluster☆325Oct 29, 2020Updated 5 years ago
- Utility for benchmarking changes in Spark using TPC-DS workloads☆16Jun 3, 2021Updated 4 years ago
- Transactions for Stateful Functions as a Service. This repository implements and API and associated underpinnings for two-phase Commit an…☆25Dec 15, 2022Updated 3 years ago
- cephfs-hadoop☆58Dec 10, 2020Updated 5 years ago