kevinweil / hadoop-lzoLinks
Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
☆100Updated 13 years ago
Alternatives and similar repositories for hadoop-lzo
Users that are interested in hadoop-lzo are comparing it to the libraries listed below
Sorting:
- Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.☆418Updated 2 years ago
- ☆558Updated 3 years ago
- Example code for Kudu☆77Updated 6 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆553Updated last year
- A collection of spouts, bolts, serializers, DSLs, and other goodies to use with Storm☆579Updated 3 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆316Updated 3 years ago
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆344Updated 8 years ago
- Kafka consumer emitting messages as storm tuples☆104Updated 4 years ago
- Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning.☆90Updated 9 years ago
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆202Updated 5 years ago
- GeoIP Functions for hive☆48Updated 5 years ago
- An Apache Flume Sink implementation to publish data to Apache Kafka☆59Updated 10 years ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- Mirror of Apache Slider☆77Updated 6 years ago
- This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one☆561Updated 3 years ago
- SparkOnHBase☆279Updated 4 years ago
- Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive☆287Updated 9 years ago
- Kafka as Hive Storage☆66Updated 11 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆273Updated 8 years ago
- ElasticSearch integration for Apache Spark☆47Updated 9 years ago
- A Maven-based example of using Cloudera Impala's JDBC driver☆118Updated 9 years ago
- A simple storm performance/stress test☆74Updated 2 years ago
- ☆243Updated 7 years ago
- Mirror of Apache Sentry☆34Updated 6 years ago
- Kite SDK☆393Updated 3 years ago
- Plugins for Azkaban.☆130Updated 7 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- Companion Code for Using Flume Book☆32Updated 10 years ago