Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
☆129Sep 7, 2018Updated 7 years ago
Alternatives and similar repositories for babar
Users that are interested in babar are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Java event logs collector for hadoop and frameworks☆42Mar 25, 2025Updated last year
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆306Oct 30, 2025Updated 6 months ago
- ☆12May 16, 2017Updated 8 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- An embedded job scheduler.☆117Jul 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 4 months ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 3 years ago
- JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter☆1,807Apr 24, 2026Updated last week
- ☆14Sep 18, 2016Updated 9 years ago
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆821Apr 24, 2026Updated last week
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- Hadoop FSImage Analyzer (HFSA)☆68Apr 25, 2026Updated last week
- ☆17Aug 8, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- How to plot for papers, slides, demos, etc.☆10Apr 7, 2022Updated 4 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- ☆18Jan 17, 2025Updated last year
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,370Aug 22, 2023Updated 2 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Jul 7, 2016Updated 9 years ago
- ☆17Mar 19, 2024Updated 2 years ago
- (Archived) End-to-end SQL fuzz testing for DataFusion using SQLancer☆13Apr 16, 2026Updated 2 weeks ago
- Hadoop utility jar for troubleshooting integration with cloud object stores☆37Mar 3, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Qubole Sparklens tool for performance tuning Apache Spark☆589Jun 26, 2024Updated last year
- A collection of Apache Parquet add-on modules☆30Apr 27, 2026Updated last week
- Train TensorFlow models on YARN in just a few lines of code!☆93Nov 3, 2023Updated 2 years ago
- ☆102Mar 23, 2020Updated 6 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Mar 26, 2015Updated 11 years ago
- ☆109Jul 5, 2023Updated 2 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆898Apr 27, 2026Updated last week
- Using log4j insert log info into ElasticSearch☆26Oct 31, 2016Updated 9 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆37Mar 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Cache File System optimized for columnar formats and object stores☆188Aug 11, 2022Updated 3 years ago
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆55Jan 2, 2023Updated 3 years ago
- ☆11Feb 24, 2016Updated 10 years ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated 2 months ago
- Serverless proxy for Spark cluster☆325Apr 13, 2026Updated 3 weeks ago
- Utility for benchmarking changes in Spark using TPC-DS workloads☆16Jun 3, 2021Updated 4 years ago
- A tool and library for easily deploying applications on Apache YARN☆145Mar 12, 2024Updated 2 years ago