Data and example code for Programming Pig, by Alan F. Gates
☆186Oct 15, 2016Updated 9 years ago
Alternatives and similar repositories for programmingpig
Users that are interested in programmingpig are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collection of Pig scripts that I use for my talks and workshops☆39Apr 30, 2013Updated 13 years ago
- ☆44Jul 24, 2017Updated 8 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆581Jul 8, 2014Updated 11 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Sep 25, 2014Updated 11 years ago
- Meta-repository of big data tools -- source and essential plugins for hadoop, pig, wukong, storm, kafka etc.☆30Jun 29, 2014Updated 12 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Eclipse plugin for Apache Pig☆33Jul 22, 2013Updated 12 years ago
- Tools for analysing and visualising activity around Twitter backchannels☆26Nov 10, 2012Updated 13 years ago
- Apache Pig plugin for Eclipse☆12Feb 28, 2017Updated 9 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆51Jul 4, 2011Updated 15 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- Examples of use of pig scripting languages capabilities☆39Aug 1, 2016Updated 9 years ago
- Provides canary based prewarming of lambda functions for Kinesis Event Sources.☆15Oct 13, 2020Updated 5 years ago
- A generator for synthetic streams of financial transactions.☆16Feb 3, 2014Updated 12 years ago
- Python Client for WebHDFS REST API☆43May 8, 2015Updated 11 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A virtual Apache Zookeper cluster with Vagrant, VirtualBox and Ansible☆20Aug 16, 2016Updated 9 years ago
- Machine learning and natural language processing with Apache Pig☆53Dec 17, 2013Updated 12 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Piglet is a DSL for writing Pig scripts in Ruby☆83Jul 21, 2010Updated 15 years ago
- A simple Scala Based Project Template for Apache Spark☆21Oct 21, 2016Updated 9 years ago
- useful JVM classes for the mrjob hadoop streaming framework☆31Jun 20, 2013Updated 13 years ago
- Spider/Parser for gathering the election data from Russian Election Committee website☆16Aug 31, 2015Updated 10 years ago
- Apache Sqoop Cookbook☆36Dec 30, 2013Updated 12 years ago
- Asakusa Framework Examples☆24Jan 7, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- diff large files without running out of memory; only unified format; probably buggy, but ~no memory usage☆14Mar 6, 2014Updated 12 years ago
- Introductory sample scala app using Apache Spark Streaming to accept data from Kafka and write a summary to Cassandra.☆22Dec 5, 2018Updated 7 years ago
- gathering point for open source OCR scripts and diffs☆43Jun 27, 2014Updated 12 years ago
- Functional testing framework for Big Data pipelines.☆59Jul 6, 2023Updated 2 years ago
- All artifacts related to the Hortonworks Data Platform☆19Dec 16, 2022Updated 3 years ago
- Computes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.☆26Jan 8, 2015Updated 11 years ago
- ☆194Jun 21, 2022Updated 4 years ago
- ☆23Nov 17, 2022Updated 3 years ago
- An analysis of adverse drug event data using Hadoop, R, and Gephi☆44Jan 28, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A JRuby DSL for Cascading☆41Sep 23, 2015Updated 10 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆37May 14, 2019Updated 7 years ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- Experimental: Multi-producer Single-consumer Queue☆12Jul 30, 2012Updated 13 years ago
- Taller SparkR para las Jornadas de Usuarios de R☆13Nov 21, 2016Updated 9 years ago
- This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.☆57Jun 10, 2018Updated 8 years ago
- NCPR storage and api☆22Mar 30, 2018Updated 8 years ago