apache / pig
Mirror of Apache Pig
☆682Updated 3 months ago
Alternatives and similar repositories for pig:
Users that are interested in pig are comparing it to the libraries listed below
- Mirror of Apache Oozie☆722Updated this week
- Apache Tez☆486Updated this week
- Mirror of Apache Sqoop☆977Updated 3 years ago
- Apache Phoenix☆1,030Updated this week
- Mirror of Apache Giraph☆619Updated last year
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,137Updated last year
- LinkedIn's previous generation Kafka to HDFS pipeline.☆876Updated 4 years ago
- Mirror of Apache Samza☆819Updated 3 weeks ago
- Mirror of Apache Knox☆193Updated this week
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 2 years ago
- Kite SDK☆393Updated 2 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆549Updated 8 months ago
- Mirror of Apache Apex core☆349Updated 3 years ago
- A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.☆608Updated last year
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆625Updated last week
- Mirror of Apache Eagle☆409Updated 4 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆516Updated 5 years ago
- Apache Impala☆1,175Updated this week
- Contains the code used in the HBase: The Definitive Guide book.☆908Updated 2 years ago
- Mirror of Apache Hadoop HDFS☆197Updated 6 years ago
- Mirror of Apache Hadoop common☆159Updated 4 years ago
- Mirror of Apache Sentry☆120Updated 4 years ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆707Updated this week
- Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.☆417Updated last year
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,349Updated last year
- Mirror of Apache Helix☆472Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,545Updated 3 months ago