alanfgates / programmingpig
Data and example code for Programming Pig, by Alan F. Gates
☆188Updated 8 years ago
Alternatives and similar repositories for programmingpig:
Users that are interested in programmingpig are comparing it to the libraries listed below
- Examples for learning spark☆332Updated 9 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆165Updated 9 years ago
- Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.☆237Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- Simple Spark Application☆76Updated last year
- Collection of Pig scripts that I use for my talks and workshops☆40Updated 11 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- Gallery of Apache Zeppelin notebooks☆215Updated 5 years ago
- ☆24Updated 9 years ago
- Oozie Samples☆52Updated 11 years ago
- Examples for Apache Oozie book☆18Updated 8 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive☆287Updated 8 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 8 years ago
- Utility to easily copy files into HDFS☆69Updated 5 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Updated 7 years ago
- Source code for Big Data: Principles and best practices of scalable realtime data systems☆332Updated 9 months ago
- Large scale query engine benchmark☆99Updated 8 years ago
- ☆23Updated 8 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- Code examples and docker environment for Spark☆27Updated 9 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Visualize streaming machine learning in Spark☆176Updated 7 years ago
- A free electronic book about Apache Hive. The book is geared towards SQL-knowledgeable business users with some advanced tips for devops.…☆103Updated 7 years ago
- SQL Windowing Functions for Hadoop☆65Updated 2 years ago
- Self-written notes that may be useful☆107Updated 9 years ago
- Apache Sqoop Cookbook☆36Updated 11 years ago
- ☆76Updated 9 years ago
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago