ptgoetz / storm-hdfsLinks
Storm components for interacting with HDFS file systems
☆60Updated 8 years ago
Alternatives and similar repositories for storm-hdfs
Users that are interested in storm-hdfs are comparing it to the libraries listed below
Sorting:
- Kafka consumer emitting messages as storm tuples☆104Updated 4 years ago
- Storm primitives to allow out-of-band messaging to storm spouts and bolts.☆87Updated 5 years ago
- Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.☆418Updated 2 years ago
- LinkedIn's previous generation Kafka to HDFS pipeline.☆883Updated 5 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆273Updated 8 years ago
- Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning.☆90Updated 8 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆319Updated 3 years ago
- A HBase connector for Storm☆117Updated 12 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆633Updated 3 years ago
- SparkOnHBase☆279Updated 4 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆515Updated 5 years ago
- [PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streamin…☆724Updated 3 years ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago
- This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one☆561Updated 2 years ago
- Kafka as Hive Storage☆66Updated 10 years ago
- ☆243Updated 7 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- A collection of spouts, bolts, serializers, DSLs, and other goodies to use with Storm☆579Updated 3 years ago
- Sql interface to druid.☆77Updated 9 years ago
- An Apache Flume Sink implementation to publish data to Apache Kafka☆59Updated 10 years ago
- Secondary Index for HBase☆592Updated 8 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆282Updated 7 years ago
- A practical Storm Trident tutorial☆122Updated last year
- Connect Spark to HBase for reading and writing data with ease☆297Updated 7 years ago
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆344Updated 8 years ago
- A Maven-based example of using Cloudera Impala's JDBC driver☆118Updated 9 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- Kafka Ganglia Metrics Reporter☆39Updated last year
- Spark RDD to read, write and delete from HBase☆276Updated 4 years ago
- ☆128Updated 6 years ago