apache/pig

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/pig)

apache / pig

Mirror of Apache Pig

☆687

Alternatives and similar repositories for pig

Users that are interested in pig are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / hive
View on GitHub
Apache Hive
☆5,994Updated this week
apache / oozie
View on GitHub
Mirror of Apache Oozie
☆729Jan 27, 2025Updated last year
apache / sqoop
View on GitHub
Mirror of Apache Sqoop
☆974Apr 8, 2021Updated 5 years ago
apache / tez
View on GitHub
Apache Tez
☆516Updated this week
apache / logging-flume
View on GitHub
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…
☆2,566Jul 10, 2026Updated last week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
apache / storm
View on GitHub
Apache Storm
☆6,692Updated this week
apache / hbase
View on GitHub
Apache HBase
☆5,548Updated this week
apache / mahout
View on GitHub
Apache Mahout - an environment for quickly creating scalable, performant machine learning applications.
☆2,298Updated this week
apache / hadoop-hdfs
View on GitHub
Mirror of Apache Hadoop HDFS
☆202Dec 10, 2018Updated 7 years ago
apache / hcatalog
View on GitHub
Mirror of Apache HCatalog
☆59Apr 14, 2023Updated 3 years ago
apache / hadoop-mapreduce
View on GitHub
Mirror of Apache Hadoop MapReduce
☆115Oct 27, 2019Updated 6 years ago
apache / chukwa
View on GitHub
Mirror of Apache Chukwa
☆85Mar 31, 2019Updated 7 years ago
apache / hadoop
View on GitHub
Apache Hadoop
☆15,614Updated this week
apache / avro
View on GitHub
Apache Avro is a data serialization system.
☆3,288Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
apache / zookeeper
View on GitHub
Apache ZooKeeper
☆12,782Updated this week
apache / cassandra
View on GitHub
Open source transactional distributed database. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructu…
☆9,866Updated this week
apache / crunch
View on GitHub
Mirror of Apache Crunch (Incubating)
☆110Feb 2, 2021Updated 5 years ago
apache / giraph
View on GitHub
Mirror of Apache Giraph
☆620Apr 14, 2023Updated 3 years ago
apache / mesos
View on GitHub
Apache Mesos
☆5,368May 15, 2026Updated 2 months ago
apache / datafu
View on GitHub
Mirror of Apache DataFu
☆124Jul 9, 2026Updated last week
apache / drill
View on GitHub
Apache Drill is a distributed MPP query layer for self describing data
☆2,020Updated this week
apache / kafka
View on GitHub
Apache Kafka - A distributed event streaming platform
☆33,286Updated this week
apache / samza
View on GitHub
Mirror of Apache Samza
☆846May 15, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
apache / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆43,658Updated this week
apache / thrift
View on GitHub
Apache Thrift
☆10,938Updated this week
apache / ant
View on GitHub
Apache Ant is a Java-based build tool.
☆467Updated this week
apache / impala
View on GitHub
Apache Impala
☆1,279Updated this week
apache / activemq
View on GitHub
Apache ActiveMQ
☆2,442Updated this week
apache / phoenix
View on GitHub
Apache Phoenix
☆1,060Updated this week
apache / ambari
View on GitHub
Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.
☆2,306Updated this week
cloudera / Impala
View on GitHub
Real-time Query for Hadoop; mirror of Apache Impala
☆34Dec 27, 2022Updated 3 years ago
apache / hadoop-common
View on GitHub
Mirror of Apache Hadoop common
☆162Mar 4, 2020Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
sigmoidanalytics / spork
View on GitHub
Pig on Apache Spark
☆82Mar 23, 2015Updated 11 years ago
spring-attic / spring-hadoop
View on GitHub
Spring for Apache Hadoop is a framework for application developers to take advantage of the features of both Hadoop and Spring.
☆620Apr 4, 2022Updated 4 years ago
apache / couchdb
View on GitHub
Seamless multi-primary syncing database with an intuitive HTTP/JSON API, designed for reliability
☆6,928Updated this week
cloudera / flume
View on GitHub
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…
☆943May 26, 2021Updated 5 years ago
apache / falcon
View on GitHub
Mirror of Apache Falcon
☆104Mar 7, 2019Updated 7 years ago
cloudera / hue
View on GitHub
Open source SQL Query Assistant service for Databases/Warehouses
☆1,413Updated this week
apache / nutch
View on GitHub
Apache Nutch is an extensible and scalable web crawler
☆3,262Updated this week