sigmoidanalytics/spork

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sigmoidanalytics/spork)

sigmoidanalytics / spork

Pig on Apache Spark

☆82

Alternatives and similar repositories for spork

Users that are interested in spork are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

databricks / pig-on-spark
View on GitHub
proof-of-concept implementation of Pig-on-Spark integrated at the logical node level
☆29Jul 7, 2022Updated 4 years ago
miguno / avro-hadoop-starter
View on GitHub
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
☆115Nov 12, 2015Updated 10 years ago
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
elodina / syscol
View on GitHub
Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka
☆20Jan 29, 2016Updated 10 years ago
julienledem / Pig-scripting-examples
View on GitHub
Examples of use of pig scripting languages capabilities
☆39Aug 1, 2016Updated 9 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
LinkedInAttic / Cubert
View on GitHub
Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop
☆245Aug 24, 2015Updated 10 years ago
aredee / accumulo-mesos
View on GitHub
☆17Oct 27, 2015Updated 10 years ago
dvryaboy / pig
View on GitHub
Mirror of Apache Pig
☆18Jul 9, 2013Updated 13 years ago
Netflix / Lipstick
View on GitHub
Pig Visualization framework
☆466Mar 24, 2023Updated 3 years ago
ipedrazas / Zeppelin-docker
View on GitHub
Dockerfile for Apache Zeppelin
☆17Dec 9, 2015Updated 10 years ago
mesos / myriad
View on GitHub
https://github.com/apache/incubator-myriad is our new home. See
☆251Dec 2, 2015Updated 10 years ago
matthayes / sublime-text-pig
View on GitHub
Package for Apache Pig support in Sublime Text 2
☆16Apr 16, 2012Updated 14 years ago
brightcove-archive / ooyala_spark-jobserver
View on GitHub
REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…
☆345May 19, 2017Updated 9 years ago
greenplum-db / PivotalR-archive
View on GitHub
An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlib.
☆127Nov 2, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
edwardcapriolo / filecrush
View on GitHub
Remedy small files by combining them into larger ones.
☆196Jul 1, 2022Updated 4 years ago
avulanov / ann-benchmark
View on GitHub
Benchmarks of artificial neural network library for Spark MLlib
☆11Dec 3, 2015Updated 10 years ago
LinkedInAttic / datafu
View on GitHub
Hadoop library for large-scale data processing, now an Apache Incubator project
☆581Jul 8, 2014Updated 12 years ago
mmay / PigJsonLoader
View on GitHub
A Load UDF for loading JSON files with Pig
☆15Jul 6, 2011Updated 15 years ago
jjallaire / TBB
View on GitHub
Intel TBB Package for R/Rcpp
☆15Jul 7, 2014Updated 12 years ago
alienrobotwizard / sounder
View on GitHub
A grouping of Apache Pig examples.
☆65Oct 13, 2020Updated 5 years ago
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
RevolutionAnalytics / dplyr-spark
View on GitHub
spark backend for dplyr
☆47Dec 30, 2015Updated 10 years ago
PacktPublishing / Mastering-Mesos
View on GitHub
Mastering Mesos by Packt Publishing
☆12Jan 30, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
onetapbeyond / opencpu-spark-executor
View on GitHub
Apache Spark OpenCPU Executor (ROSE)
☆25Jun 16, 2018Updated 8 years ago
wilbur / Piggybank
View on GitHub
A reporistory of User-defined functions for Apache Pig
☆16Sep 20, 2010Updated 15 years ago
shivajid / HortonworksOperationsWorkshop
View on GitHub
☆14Oct 14, 2015Updated 10 years ago
laserson / dsq
View on GitHub
Distributed Streaming Quantiles (for PySpark)
☆38Jan 30, 2014Updated 12 years ago
wesleypeck / parquet-tools
View on GitHub
Command line tools for the parquet project
☆44Jul 10, 2018Updated 8 years ago
shravanpn7 / AWS-Cleanup
View on GitHub
These scripts clean the unused EBS volumes, AMIs and snapshots on Amazon Web Services.
☆11Jul 24, 2015Updated 11 years ago
mstump / golang-driver
View on GitHub
Golang wrapper of the DataStax/Cassandra C++ driver
☆25May 28, 2019Updated 7 years ago
apache / pig
View on GitHub
Mirror of Apache Pig
☆687May 15, 2026Updated 2 months ago
dbis-ilm / piglet
View on GitHub
A compiler for Pig Latin to Spark and Flink.
☆24Nov 21, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cchantep / foorgol
View on GitHub
Google API client (or one the Discworld, the Ephebian God of Avalanches).
☆16Jun 16, 2026Updated last month
Banno / samza-mesos
View on GitHub
This project allows to run Samza jobs on Mesos cluster
☆43Mar 25, 2021Updated 5 years ago
hougs / scala-dataflow-dsl
View on GitHub
A scala dsl for dataflow
☆11Dec 31, 2014Updated 11 years ago
amplab / benchmark
View on GitHub
Large scale query engine benchmark
☆99Apr 5, 2016Updated 10 years ago
lucidworks / yarn-proto
View on GitHub
Solr on YARN prototype
☆18Nov 14, 2014Updated 11 years ago
madlib / archived_madlib
View on GitHub
MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.
☆508Feb 9, 2018Updated 8 years ago
mozilla-metrics / akela
View on GitHub
A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.
☆77Mar 31, 2014Updated 12 years ago