hanborq / hadoop
A Hanborq optimized Hadoop Distribution, especially with high performance of MapReduce. It's the core part of HDH (Hanborq Distribution with Hadoop for Big Data Engineering).
☆49Updated 12 years ago
Alternatives and similar repositories for hadoop:
Users that are interested in hadoop are comparing it to the libraries listed below
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Mirror of Apache HCatalog☆60Updated last year
- Sql interface to druid.☆77Updated 9 years ago
- Kafka as Hive Storage☆66Updated 10 years ago
- Apache Tephra: Transactions for HBase.☆157Updated 5 months ago
- A plugin for flume that allows you to use Cassandra as a sink.☆59Updated 13 years ago
- Transactional Support for HBase (Mirror of https://github.com/apache/incubator-omid)☆300Updated 7 years ago
- Mirror of Apache Spark☆57Updated 9 years ago
- MySQL-like queries for Druid built on top of Plywood☆147Updated 5 years ago
- Extensions, custom & experimental panels☆52Updated 9 years ago
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆243Updated 9 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- Oozie - workflow engine for Hadoop☆373Updated 7 years ago
- DataNode Volumes Rebalancing tool for Apache Hadoop HDFS (HDFS-1312)☆23Updated 7 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆34Updated 11 years ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Updated 11 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Updated 10 years ago
- Docker containers for Druid nodes☆27Updated 8 years ago
- Mirror of Apache Blur☆33Updated 6 years ago
- Low level integration of Spark and Kafka☆130Updated 6 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- A simple storm performance/stress test☆74Updated last year
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- HBase as the backing store for the TF-IDF representations for Lucene☆108Updated 14 years ago
- Hive + Avro. Serde for working with Avro in Hive☆59Updated last year
- Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.☆91Updated 11 years ago
- Metrics produced to Kafka and consumers for monitoring them☆100Updated 10 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 9 years ago
- A Scala client library for ZooKeeper (DEPRECATED)☆119Updated 11 years ago
- ☆54Updated 10 years ago