Data and example code for Programming Pig, by Alan F. Gates
☆186Oct 15, 2016Updated 9 years ago
Alternatives and similar repositories for programmingpig
Users that are interested in programmingpig are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collection of Pig scripts that I use for my talks and workshops☆39Apr 30, 2013Updated 12 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆582Jul 8, 2014Updated 11 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Sep 25, 2014Updated 11 years ago
- ☆26Mar 18, 2016Updated 10 years ago
- Eclipse plugin for Apache Pig☆33Jul 22, 2013Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Apache Pig plugin for Eclipse☆12Feb 28, 2017Updated 9 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆50Jul 4, 2011Updated 14 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- All Certification and preparation, examples & others☆11Oct 18, 2018Updated 7 years ago
- Python Client for WebHDFS REST API☆43May 8, 2015Updated 10 years ago
- Fraud Detection Online (Hadoop application)☆18Apr 8, 2014Updated 12 years ago
- Machine learning and natural language processing with Apache Pig☆53Dec 17, 2013Updated 12 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Piglet is a DSL for writing Pig scripts in Ruby☆83Jul 21, 2010Updated 15 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple Scala Based Project Template for Apache Spark☆21Oct 21, 2016Updated 9 years ago
- A content-filtering bypass system developed specifically to allow access to trans-related resources on public networks (libraries, school…☆27Nov 15, 2014Updated 11 years ago
- Spider/Parser for gathering the election data from Russian Election Committee website☆16Aug 31, 2015Updated 10 years ago
- Apache Sqoop Cookbook☆36Dec 30, 2013Updated 12 years ago
- Monorepo for pi packages: TUI library, agent framework, and pod management CLI☆46Dec 10, 2025Updated 4 months ago
- Asakusa Framework Examples☆24Jan 7, 2021Updated 5 years ago
- Code samples for the book☆39Sep 10, 2013Updated 12 years ago
- Real-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.☆26Jan 4, 2015Updated 11 years ago
- Functional testing framework for Big Data pipelines.☆60Jul 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- All artifacts related to the Hortonworks Data Platform☆19Dec 16, 2022Updated 3 years ago
- A repository for all code generated at our Datadive events☆36May 12, 2012Updated 13 years ago
- ☆13Sep 18, 2025Updated 6 months ago
- ☆195Jun 21, 2022Updated 3 years ago
- Java version of D.J. Bernstein's constant database (cdb) library.☆17Jan 30, 2026Updated 2 months ago
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- An analysis of adverse drug event data using Hadoop, R, and Gephi☆44Jan 28, 2016Updated 10 years ago
- This application "listens" for a ticket creation event from Zendesk, analyses the ticket for negative sentiment, tags the ticket accordin…☆14Mar 10, 2025Updated last year
- ☆11Feb 13, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The source code for my PyCon 2017 talk "5 ways to deploy you Python web app in 2017"☆10May 19, 2017Updated 8 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38May 14, 2019Updated 6 years ago
- Spark Streaming HBase Example☆22Mar 16, 2016Updated 10 years ago
- Examples for using Amazon SageMaker Operators for Kubernetes☆12Mar 4, 2020Updated 6 years ago
- A demo project that replicates a Spring Batch tutorial using Apache Camel within a Spring Boot app☆14Apr 21, 2019Updated 6 years ago
- ☆28Jul 13, 2016Updated 9 years ago
- read and write JSON-stat with R☆32Sep 4, 2023Updated 2 years ago