Data and example code for Programming Pig, by Alan F. Gates
☆186Oct 15, 2016Updated 9 years ago
Alternatives and similar repositories for programmingpig
Users that are interested in programmingpig are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collection of Pig scripts that I use for my talks and workshops☆39Apr 30, 2013Updated 13 years ago
- This repository contains the Pig Latin scripts, UDFs and datasets used in the book Pig Design Patterns by Pradeep Pasupuleti, published b…☆23Apr 9, 2014Updated 12 years ago
- ☆44Jul 24, 2017Updated 8 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆581Jul 8, 2014Updated 11 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Sep 25, 2014Updated 11 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆26Mar 18, 2016Updated 10 years ago
- Meta-repository of big data tools -- source and essential plugins for hadoop, pig, wukong, storm, kafka etc.☆30Jun 29, 2014Updated 11 years ago
- Eclipse plugin for Apache Pig☆33Jul 22, 2013Updated 12 years ago
- Tools for analysing and visualising activity around Twitter backchannels☆26Nov 10, 2012Updated 13 years ago
- Apache Pig plugin for Eclipse☆12Feb 28, 2017Updated 9 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆50Jul 4, 2011Updated 14 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- SQL Windowing Functions for Hadoop☆65Jun 20, 2022Updated 3 years ago
- Android Live information coming from Twitter☆35Feb 6, 2014Updated 12 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- All Certification and preparation, examples & others☆11Oct 18, 2018Updated 7 years ago
- Fraud Detection Online (Hadoop application)☆18Apr 8, 2014Updated 12 years ago
- Machine learning and natural language processing with Apache Pig☆53Dec 17, 2013Updated 12 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Piglet is a DSL for writing Pig scripts in Ruby☆83Jul 21, 2010Updated 15 years ago
- Tool to help users migrate large relational databases into Hadoop clusters.☆67Mar 23, 2012Updated 14 years ago
- A simple Scala Based Project Template for Apache Spark☆21Oct 21, 2016Updated 9 years ago
- Spider/Parser for gathering the election data from Russian Election Committee website☆16Aug 31, 2015Updated 10 years ago
- Apache Sqoop Cookbook☆36Dec 30, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Spring Design Patterns and Best Practices [video], published by Packt☆13Jan 30, 2023Updated 3 years ago
- gathering point for open source OCR scripts and diffs☆43Jun 27, 2014Updated 11 years ago
- Real-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.☆26Jan 4, 2015Updated 11 years ago
- ☆10Nov 14, 2016Updated 9 years ago
- Fureteur is a simple, configurable, fault-tolerant web crawler written is Scala☆29Oct 14, 2014Updated 11 years ago
- All artifacts related to the Hortonworks Data Platform☆19Dec 16, 2022Updated 3 years ago
- Computes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.☆26Jan 8, 2015Updated 11 years ago
- Real-time analytics in Apache Flume☆51Feb 2, 2016Updated 10 years ago
- ☆195Jun 21, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆23Nov 17, 2022Updated 3 years ago
- Java version of D.J. Bernstein's constant database (cdb) library.☆17Jan 30, 2026Updated 3 months ago
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Clojure core.async patterns☆12Jan 30, 2019Updated 7 years ago
- An analysis of adverse drug event data using Hadoop, R, and Gephi☆44Jan 28, 2016Updated 10 years ago
- Source Code for 'Beginning Apache Spark 2' by Hien Luu☆17Sep 1, 2018Updated 7 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆37May 14, 2019Updated 6 years ago