Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
☆128Sep 7, 2018Updated 7 years ago
Alternatives and similar repositories for babar
Users that are interested in babar are comparing it to the libraries listed below
Sorting:
- Java event logs collector for hadoop and frameworks☆41Mar 25, 2025Updated 11 months ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆304Oct 30, 2025Updated 4 months ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- An embedded job scheduler.☆117Jul 29, 2024Updated last year
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆17Jan 4, 2026Updated 2 months ago
- ☆14Sep 18, 2016Updated 9 years ago
- ☆12May 16, 2017Updated 8 years ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 2 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆590Jun 26, 2024Updated last year
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆816Updated this week
- JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter☆1,806Jul 12, 2025Updated 7 months ago
- ☆18Jan 17, 2025Updated last year
- Hadoop utility to compact small files☆18Feb 16, 2026Updated 2 weeks ago
- Utility for benchmarking changes in Spark using TPC-DS workloads☆16Jun 3, 2021Updated 4 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Oct 8, 2025Updated 4 months ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,371Aug 22, 2023Updated 2 years ago
- Serverless proxy for Spark cluster☆324Oct 29, 2020Updated 5 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆72Jan 1, 2023Updated 3 years ago
- ☆17Mar 19, 2024Updated last year
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Mar 26, 2015Updated 10 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆36Feb 2, 2026Updated last month
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- Hadoop FSImage Analyzer (HFSA)☆66Feb 24, 2026Updated last week
- A collection of Apache Parquet add-on modules☆30Updated this week
- Hadoop utility jar for troubleshooting integration with cloud object stores☆37Feb 20, 2026Updated last week
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- My branch of Apache Flume with a generic JDBC sink (not yet licensed to Apache)☆11Feb 12, 2022Updated 4 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆46Feb 4, 2026Updated last month
- An extensible Scala framework for creating monitoring dashboards.☆22Jan 12, 2023Updated 3 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Jul 7, 2016Updated 9 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆280Aug 3, 2018Updated 7 years ago
- A load balancer / proxy / gateway for prestodb☆358Jul 25, 2024Updated last year
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆889Feb 9, 2026Updated 3 weeks ago
- ☆13Mar 3, 2025Updated last year
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago