jinyeluo / smarthbasecompactor
a smart, automated non-intrusive driver for hbase region-level major-compact
☆8Updated 8 years ago
Alternatives and similar repositories for smarthbasecompactor
Users that are interested in smarthbasecompactor are comparing it to the libraries listed below
Sorting:
- Tachyon service for Ambari☆9Updated 9 years ago
- Allows wrapping existing WebUI pages and present them as Ambari Views☆9Updated 9 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Updated 7 years ago
- Quickly deploy Hadoop with the help of Ansible and Apache Ambari☆38Updated 9 years ago
- Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.☆14Updated 9 years ago
- Presto K8S Operator☆9Updated 5 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 9 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 3 months ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆34Updated 11 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- HDFS Automatic Snapshot Service for Linux☆12Updated 8 years ago
- Dockerfile and artifacts for running a self-contained HDP 2.3 "cluster" in a docker container☆10Updated 8 years ago
- ☆26Updated 5 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆19Updated 7 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- Hadoop YARN & MapReduce Memory Calculator☆13Updated 9 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 7 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Updated 8 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 4 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 8 years ago
- ☆26Updated 8 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year