Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
☆129Sep 7, 2018Updated 7 years ago
Alternatives and similar repositories for babar
Users that are interested in babar are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Java event logs collector for hadoop and frameworks☆42Mar 25, 2025Updated last year
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆306Oct 30, 2025Updated 8 months ago
- Client libraries of end users of Apache Kyuubi☆11May 15, 2026Updated last month
- An embedded job scheduler.☆116Jul 29, 2024Updated last year
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16May 22, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆256Apr 7, 2023Updated 3 years ago
- JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter☆1,802May 21, 2026Updated last month
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆826May 19, 2026Updated last month
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- Hadoop FSImage Analyzer (HFSA)☆68Jun 24, 2026Updated last week
- ☆17Aug 8, 2017Updated 8 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- ☆18Jan 17, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,368Aug 22, 2023Updated 2 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆16Nov 11, 2018Updated 7 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Jul 7, 2016Updated 9 years ago
- ☆17Mar 19, 2024Updated 2 years ago
- (Archived) End-to-end SQL fuzz testing for DataFusion using SQLancer☆13Apr 16, 2026Updated 2 months ago
- Hadoop utility jar for troubleshooting integration with cloud object stores☆38Updated this week
- Qubole Sparklens tool for performance tuning Apache Spark☆591Jun 26, 2024Updated 2 years ago
- A collection of Apache Parquet add-on modules☆30Jun 14, 2026Updated 2 weeks ago
- Train TensorFlow models on YARN in just a few lines of code!☆93Nov 3, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Aug 29, 2022Updated 3 years ago
- ☆102Mar 23, 2020Updated 6 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆240Mar 26, 2015Updated 11 years ago
- ☆109Jul 5, 2023Updated 2 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆905Jun 9, 2026Updated 3 weeks ago
- Using log4j insert log info into ElasticSearch☆26Oct 31, 2016Updated 9 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆39May 7, 2026Updated last month
- Cache File System optimized for columnar formats and object stores☆188Aug 11, 2022Updated 3 years ago
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆55Jan 2, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Feb 24, 2016Updated 10 years ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated 4 months ago
- Serverless proxy for Spark cluster☆324Apr 13, 2026Updated 2 months ago
- A tool and library for easily deploying applications on Apache YARN☆146Mar 12, 2024Updated 2 years ago
- Utility for benchmarking changes in Spark using TPC-DS workloads☆16Jun 3, 2021Updated 5 years ago
- Transactions for Stateful Functions as a Service. This repository implements and API and associated underpinnings for two-phase Commit an…☆25Dec 15, 2022Updated 3 years ago
- cephfs-hadoop☆57Dec 10, 2020Updated 5 years ago