apache / pigLinks
Mirror of Apache Pig
☆688Updated 7 months ago
Alternatives and similar repositories for pig
Users that are interested in pig are comparing it to the libraries listed below
Sorting:
- Mirror of Apache Oozie☆722Updated 4 months ago
- Mirror of Apache Sqoop☆982Updated 4 years ago
- Apache Phoenix☆1,039Updated last week
- Apache Tez☆494Updated 2 weeks ago
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 2 years ago
- LinkedIn's previous generation Kafka to HDFS pipeline.☆878Updated 4 years ago
- Mirror of Apache Samza☆826Updated 3 weeks ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆550Updated last year
- Mirror of Apache Hadoop common☆159Updated 5 years ago
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,553Updated 7 months ago
- Mirror of Apache Giraph☆618Updated 2 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,137Updated 2 years ago
- Mirror of Apache Apex core☆348Updated 3 years ago
- Kite SDK☆394Updated 2 years ago
- Mirror of Apache Hadoop HDFS☆199Updated 6 years ago
- Mirror of Apache Eagle☆410Updated 4 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆516Updated 5 years ago
- Contains the code used in the HBase: The Definitive Guide book.☆909Updated 2 years ago
- Mirror of Apache Sentry☆119Updated 4 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 2 years ago
- High performance data store solution☆1,435Updated 2 months ago
- MongoDB Connector for Hadoop☆1,518Updated 3 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,357Updated last year
- Secondary Index for HBase☆593Updated 8 years ago
- Oozie - workflow engine for Hadoop☆373Updated 7 years ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,974Updated this week
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆640Updated last year
- Mirror of Apache Knox☆198Updated this week
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆913Updated last week
- Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.☆416Updated last year