Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,133Apr 10, 2023Updated 2 years ago
Alternatives and similar repositories for elephant-bird
Users that are interested in elephant-bird are comparing it to the libraries listed below
Sorting:
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆551Apr 24, 2024Updated last year
- Hadoop library for large-scale data processing, now an Apache Incubator project☆582Jul 8, 2014Updated 11 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 8 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Mar 31, 2014Updated 11 years ago
- A Scala API for Cascading☆3,523May 28, 2023Updated 2 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,261Feb 27, 2026Updated last week
- Elephant Twin is a framework for creating indexes in Hadoop☆98Oct 12, 2020Updated 5 years ago
- Embedded Kafka for testing and quick prototyping.☆14Apr 19, 2016Updated 9 years ago
- Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more☆8,792Aug 16, 2017Updated 8 years ago
- Distributed database specialized in exporting key/value data from Hadoop☆558Jun 27, 2014Updated 11 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,371Aug 22, 2023Updated 2 years ago
- A set of examples and utilities for using Pig with Cassandra. For the latest jar release, check the Downloads link.☆84Aug 21, 2014Updated 11 years ago
- A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.☆609May 19, 2023Updated 2 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive