Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
☆72Jan 1, 2023Updated 3 years ago
Alternatives and similar repositories for jumbune
Users that are interested in jumbune are comparing it to the libraries listed below
Sorting:
- Data Quality Monitoring Tool☆15Dec 5, 2017Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- Materials for various Hadoop & Nifi related workshops☆52Mar 20, 2019Updated 7 years ago
- Avro, Protobuf, Thrift on Swagger☆19Jul 10, 2017Updated 8 years ago
- Manufacturing specifications☆25Jun 6, 2022Updated 3 years ago
- A collection of pentest tools and resources targeting Hadoop environments☆35Mar 2, 2017Updated 9 years ago
- Remedy small files by combining them into larger ones.☆195Jul 1, 2022Updated 3 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 5 months ago
- Kafka Source and Sink Connectors☆19Jan 18, 2018Updated 8 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 9 years ago
- A more pretty, more usable web dashboard for Apache Oozie, written in Scala.☆72May 6, 2013Updated 12 years ago
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆127Jan 14, 2022Updated 4 years ago
- DataQuality for BigData☆148Dec 15, 2023Updated 2 years ago
- Expletives vomiting library...☆13Apr 17, 2017Updated 8 years ago
- Visualize your HDFS cluster usage☆228Oct 13, 2020Updated 5 years ago
- YAML-based database of datacenter infrastructures☆25Dec 22, 2025Updated 3 months ago
- Spark code to analyze HBase Snapshots☆35Feb 19, 2018Updated 8 years ago
- Hadoop FSImage Analyzer (HFSA)☆67Updated this week
- Generic spark module for scanning, joining and mutating HBase tables to and from RDDs.☆15Aug 14, 2015Updated 10 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆36Mar 9, 2026Updated last week
- Java helper class to call Oracle SQL*Loader (SqlLdr) tool with a nice high level interface, to perform bulk load easily from Java.☆11Mar 8, 2016Updated 10 years ago
- Lenses.io JDBC driver for Apache Kafka☆22May 7, 2021Updated 4 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Jul 23, 2020Updated 5 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- This plugin lets you document your physical servers across multiple datacenters☆21Nov 17, 2020Updated 5 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆55May 9, 2017Updated 8 years ago
- ☆15Jan 17, 2022Updated 4 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Nov 8, 2018Updated 7 years ago
- Simple way to copy data from relational databases into kafka.☆20Oct 1, 2017Updated 8 years ago
- Algorithms and Data Structures implemented in Java☆12Jul 28, 2019Updated 6 years ago
- Demo Digital Signage System intended to be used with Visionect's E-Paper platform for quick-start. Based in Go and Javascript, built cros…☆16Apr 28, 2016Updated 9 years ago
- Scripts for building Cloudera Manager parcel and CSD for Livy Spark Server☆21Oct 18, 2017Updated 8 years ago
- ☆11Dec 14, 2016Updated 9 years ago
- Hadoop Cluster Configurations☆32Aug 5, 2021Updated 4 years ago
- Groovy client library for Apache Ambari's REST API☆20Jun 25, 2021Updated 4 years ago
- high performance statsd drop-in☆21Mar 29, 2016Updated 9 years ago