twitter / hadoop-lzoView external linksLinks
Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
☆551Apr 24, 2024Updated last year
Alternatives and similar repositories for hadoop-lzo
Users that are interested in hadoop-lzo are comparing it to the libraries listed below
Sorting:
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,132Apr 10, 2023Updated 2 years ago
- Elephant Twin is a framework for creating indexes in Hadoop☆98Oct 12, 2020Updated 5 years ago
- Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆37Aug 13, 2012Updated 13 years ago
- Splittable Gzip codec for Hadoop☆74Dec 12, 2025Updated 2 months ago
- Snappy compression for Hadoop☆40Jun 18, 2015Updated 10 years ago
- Mirror of Apache Oozie☆727Jan 27, 2025Updated last year
- This source can record the position of file if the flume application has been killed,it also know which line should be read from next tim…☆19Jan 9, 2017Updated 9 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Jan 15, 2026Updated last month
- REST job server for Apache Spark☆2,845Jul 8, 2025Updated 7 months ago
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆241Jan 8, 2016Updated 10 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 8 years ago
- Visualize your HDFS cluster usage☆228Oct 13, 2020Updated 5 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆127Jan 14, 2022Updated 4 years ago
- Metrics produced to Kafka and consumers for monitoring them☆102Jan 10, 2015Updated 11 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.☆90Apr 11, 2013Updated 12 years ago
- A C interface to watchman☆47Jun 7, 2019Updated 6 years ago
- Apache Phoenix☆1,050Updated this week
- DEPRECATED☆18Sep 12, 2018Updated 7 years ago
- Sample Python code for working with the HBase REST interface☆24Jul 25, 2013Updated 12 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,371Aug 22, 2023Updated 2 years ago
- Open source SQL Query Assistant service for Databases/Warehouses☆1,466Updated this week
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆763Feb 8, 2026Updated last week
- A Scala API for Cascading☆3,525May 28, 2023Updated 2 years ago
- StatHat API Wrapper.☆34Feb 4, 2016Updated 10 years ago
- http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf☆14Apr 25, 2012Updated 13 years ago
- Utilities to allow Angular templates to use Node.bind()☆26Mar 12, 2020Updated 5 years ago
- Gather statistics about the jvm garbage collection and push into ganglia☆32Feb 18, 2010Updated 15 years ago
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond☆1,032Updated this week
- Apache Hive☆6,007Updated this week
- Real-time Query for Hadoop; mirror of Apache Impala☆34Dec 27, 2022Updated 3 years ago
- Python module that allows one to easily write and run Hadoop programs.☆1,032Jan 9, 2018Updated 8 years ago
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,609Updated this week
- Cloudera Manager API Client☆308Dec 17, 2023Updated 2 years ago
- Scala implementations of standard algorithms for Multi-Armed Bandits Problem.☆12May 7, 2016Updated 9 years ago
- Solr on YARN prototype☆18Nov 14, 2014Updated 11 years ago
- Tracking events, CfPs, abstracts, slides, and all other even related things☆22Oct 4, 2019Updated 6 years ago