killerwhile / volume-balancerLinks
DataNode Volumes Rebalancing tool for Apache Hadoop HDFS (HDFS-1312)
☆23Updated 8 years ago
Alternatives and similar repositories for volume-balancer
Users that are interested in volume-balancer are comparing it to the libraries listed below
Sorting:
- Low level integration of Spark and Kafka☆130Updated 7 years ago
- ☆243Updated 7 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆216Updated 9 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- Apache Spark and Apache Kafka integration example☆124Updated 8 years ago
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆345Updated 8 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Updated 10 years ago
- Mirror of Apache Slider☆77Updated 7 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 4 years ago
- ☆92Updated 8 years ago
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Updated 8 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 8 years ago
- Tools for spark which we use on the daily basis☆65Updated 5 years ago
- Framework for Apache Flink unit tests☆210Updated 6 years ago
- Apache Spark applications☆70Updated 8 years ago
- Mirror of Apache Spark☆56Updated 10 years ago
- Write your Spark data to Kafka seamlessly☆174Updated last year
- Easy metrics collection for Storm topologies using Coda Hale Metrics☆101Updated 12 years ago
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆154Updated 3 years ago
- ☆56Updated 11 years ago
- Spark RDD to read, write and delete from HBase☆274Updated 4 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- An Apache access log parser written in Scala☆73Updated 4 years ago
- Read SparkSQL parquet file as RDD[Protobuf]☆93Updated 7 years ago
- Trifecta is a web-based and CLI tool that simplifies inspecting Kafka messages and Zookeeper data. Additionally, the CLI tool provides th…☆216Updated 7 years ago
- Counting Twitter hashtags using Spark Streaming and Cassandra☆41Updated 10 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆184Updated 3 years ago
- Notes about Spark Streaming in Apache Spark☆60Updated 8 years ago
- An Apache Flume Sink implementation to publish data to Apache Kafka☆59Updated 10 years ago
- Storm primitives to allow out-of-band messaging to storm spouts and bolts.☆87Updated 5 years ago