kevinweil / hadoop-lzoLinks
Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
☆100Updated 14 years ago
Alternatives and similar repositories for hadoop-lzo
Users that are interested in hadoop-lzo are comparing it to the libraries listed below
Sorting:
- Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.☆418Updated 2 years ago
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆345Updated 8 years ago
- A collection of spouts, bolts, serializers, DSLs, and other goodies to use with Storm☆580Updated 3 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆551Updated last year
- Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning.☆90Updated 9 years ago
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆203Updated 5 years ago
- Kafka consumer emitting messages as storm tuples☆105Updated 5 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆316Updated 3 years ago
- SparkOnHBase☆279Updated 4 years ago
- An Apache Flume Sink implementation to publish data to Apache Kafka☆59Updated 10 years ago
- GeoIP Functions for hive☆48Updated 5 years ago
- Kite SDK☆393Updated 3 years ago
- Example code for Kudu☆77Updated 6 years ago
- Transactional and indexing extensions for hbase☆73Updated 14 years ago
- Mirror of Apache Sentry☆34Updated 6 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆273Updated 8 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆634Updated 3 years ago
- This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one☆561Updated 3 years ago
- ☆557Updated 3 years ago
- ElasticSearch integration for Apache Spark☆47Updated 9 years ago
- A HBase connector for Storm☆117Updated 12 years ago
- ☆243Updated 7 years ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆164Updated 10 years ago
- spark summit 2017 SanFrancisco☆96Updated 8 years ago
- ☆56Updated 11 years ago
- Remedy small files by combining them into larger ones.☆195Updated 3 years ago
- Mirror of Apache Slider☆77Updated 7 years ago
- Trident-ML : A realtime online machine learning library☆384Updated 2 years ago