Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
☆73Jan 1, 2023Updated 3 years ago
Alternatives and similar repositories for jumbune
Users that are interested in jumbune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- OPC UA is a popular and open source protocol for industrial control system. Apache Nifi is a data movement toolkit that provides data in…☆21Mar 1, 2018Updated 8 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Mar 2, 2023Updated 3 years ago
- Avro, Protobuf, Thrift on Swagger☆19Jul 10, 2017Updated 8 years ago
- Manufacturing specifications☆25Jun 6, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Remedy small files by combining them into larger ones.☆195Jul 1, 2022Updated 4 years ago
- Reusable code for Hive☆16Aug 19, 2014Updated 11 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Contain Interview Questions Solutions☆12May 18, 2018Updated 8 years ago
- DataQuality for BigData☆149Dec 15, 2023Updated 2 years ago
- Visualize your HDFS cluster usage☆228Oct 13, 2020Updated 5 years ago
- ☆20Apr 27, 2012Updated 14 years ago
- Generic spark module for scanning, joining and mutating HBase tables to and from RDDs.☆15Aug 14, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Hadoop FSImage Analyzer (HFSA)☆68Jun 24, 2026Updated last week
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆39May 7, 2026Updated last month
- Lenses.io JDBC driver for Apache Kafka☆22May 7, 2021Updated 5 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Dec 15, 2024Updated last year
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 3 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- Mastering Apache Spark 2x, published by Packt☆17Jan 30, 2023Updated 3 years ago
- ☆15Jan 17, 2022Updated 4 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Nov 8, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deploy Dask on Marathon☆10Feb 6, 2017Updated 9 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆27Dec 13, 2017Updated 8 years ago
- ☆11Dec 14, 2016Updated 9 years ago
- Hadoop Cluster Configurations☆32Aug 5, 2021Updated 4 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Groovy client library for Apache Ambari's REST API☆20Jun 25, 2021Updated 5 years ago
- support oauth client with nginx☆32Jan 6, 2011Updated 15 years ago
- A CLI tool that creates AWS spot instances on the fly☆18Dec 6, 2017Updated 8 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆93Mar 5, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Oct 11, 2021Updated 4 years ago
- A simple kubernetes cron☆12Jun 28, 2016Updated 10 years ago
- Kafka Connect connector for receiving data and writing data to Splunk.☆25Nov 7, 2017Updated 8 years ago
- Spark package for checking data quality☆220Feb 28, 2020Updated 6 years ago
- SequenceIQ Hadoop examples☆114Oct 26, 2015Updated 10 years ago
- AvroSchemaRegistryConvertToESMapping☆25Nov 29, 2021Updated 4 years ago
- File Watcher 核心库:轻量级Java库☆30Sep 20, 2018Updated 7 years ago