alanfgates / programmingpig
Data and example code for Programming Pig, by Alan F. Gates
☆188Updated 8 years ago
Alternatives and similar repositories for programmingpig:
Users that are interested in programmingpig are comparing it to the libraries listed below
- Examples for learning spark☆332Updated 9 years ago
- Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.☆238Updated 8 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- Oozie Samples☆52Updated 11 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆165Updated 9 years ago
- Gallery of Apache Zeppelin notebooks☆215Updated 5 years ago
- Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive☆286Updated 8 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- ☆24Updated 9 years ago
- Simple Spark Application☆76Updated last year
- Elastic Search on Spark☆112Updated 10 years ago
- Examples for Apache Oozie book☆18Updated 8 years ago
- ☆76Updated 9 years ago
- Utility to easily copy files into HDFS☆69Updated 5 years ago
- HDP Data Science/Machine Learning demo☆37Updated 9 years ago
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆344Updated 7 years ago
- Example of use of Spark Streaming with Kafka☆90Updated 10 years ago
- ☆54Updated 10 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 8 years ago
- Reference Architectures for Apache Spark☆38Updated 8 years ago
- Code examples and docker environment for Spark☆27Updated 9 years ago
- Self-written notes that may be useful☆108Updated 9 years ago
- Source code for Big Data: Principles and best practices of scalable realtime data systems☆332Updated 10 months ago
- Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1☆126Updated 9 years ago
- Examples for High Performance Spark☆508Updated 5 months ago
- A stack overflow for Apache Spark☆72Updated 7 years ago
- Large scale query engine benchmark☆99Updated 9 years ago
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆202Updated 5 years ago