alanfgates / programmingpigLinks
Data and example code for Programming Pig, by Alan F. Gates
☆187Updated 8 years ago
Alternatives and similar repositories for programmingpig
Users that are interested in programmingpig are comparing it to the libraries listed below
Sorting:
- Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.☆236Updated 8 years ago
- Source code for Big Data: Principles and best practices of scalable realtime data systems☆332Updated last year
- Examples for learning spark☆331Updated 9 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆165Updated 10 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive☆287Updated 8 years ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆348Updated 4 years ago
- ☆76Updated 10 years ago
- Oozie Samples☆52Updated 11 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 10 years ago
- Self-written notes that may be useful☆107Updated 9 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 9 years ago
- Apache Spark™ and Scala Workshops☆264Updated 11 months ago
- BerkeleyX: CS100.1x, Introduction to Big Data with Apache Spark☆12Updated 9 years ago
- Simple Spark Application☆76Updated last year
- Real-Time Analytics with Storm☆79Updated 3 years ago
- Collection of Pig scripts that I use for my talks and workshops☆39Updated 12 years ago
- Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1☆126Updated 9 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 9 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Updated 7 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- Repository for MapReduce Design Patterns (O'Reilly 2012) example source code☆235Updated 10 years ago
- Practical examples of using Apache Spark in several different use cases☆102Updated 9 years ago
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago
- Pig on Apache Spark☆83Updated 10 years ago
- Elastic Search on Spark☆112Updated 10 years ago