hanborq / hadoopLinks
A Hanborq optimized Hadoop Distribution, especially with high performance of MapReduce. It's the core part of HDH (Hanborq Distribution with Hadoop for Big Data Engineering).
☆50Updated 13 years ago
Alternatives and similar repositories for hadoop
Users that are interested in hadoop are comparing it to the libraries listed below
Sorting:
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 9 years ago
- Transactional Support for HBase (Mirror of https://github.com/apache/incubator-omid)☆299Updated 8 years ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆116Updated 10 years ago
- Apache Tephra: Transactions for HBase.☆158Updated last year
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆244Updated 10 years ago
- Sql interface to druid.☆77Updated 10 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- A plugin for flume that allows you to use Cassandra as a sink.☆59Updated 14 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆273Updated 8 years ago
- Multidimensional data storage with rollups for numerical data☆268Updated 3 months ago
- Real²time Exploratory Analytics on Large Datasets☆121Updated 6 years ago
- Tail a log file and send log lines automatically to a kafka topic☆57Updated 13 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 10 years ago
- High Throughput Real-time Stream Processing Framework☆285Updated 8 years ago
- Norbert is a cluster manager and networking layer built on top of Zookeeper.☆388Updated 3 years ago
- ☆92Updated 8 years ago
- Metrics produced to Kafka and consumers for monitoring them☆102Updated 11 years ago
- ☆40Updated 10 years ago
- A keen Observer of changes that can also relay change events reliably to interested parties. Provides useful infrastructure for building …☆24Updated 2 years ago
- Mirror of Apache Spark☆56Updated 10 years ago
- Storm on Mesos!☆138Updated 4 years ago
- Mahout vector encoding for pig☆54Updated 3 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 12 years ago
- DEPRECATED—Open source Apache Cassandra running on DC/OS is now replaced by mesosphere/dcos-commons/frameworks/cassandra. This repositor…☆116Updated 6 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- An Apache Storm IMetricsConsumer that forwards Storm's built-in metrics to a Graphite server for real-time graphing, visualization, and o…☆76Updated 2 years ago
- Docker containers for Druid nodes☆27Updated 9 years ago
- The DB that's replicated, sharded and transactional.☆175Updated 10 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 9 years ago