pradeep-pasupuleti / pig-design-patterns
This repository contains the Pig Latin scripts, UDFs and datasets used in the book Pig Design Patterns by Pradeep Pasupuleti, published by Packt.
☆23Updated 10 years ago
Related projects: ⓘ
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- Kite SDK Examples☆99Updated 3 years ago
- XML Serializer/Deserializer for Apache Hive☆41Updated 4 years ago
- Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume☆39Updated 8 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 4 years ago
- Tools for Hadoop☆25Updated 12 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 7 years ago
- ☆48Updated 8 years ago
- ☆24Updated this week
- Demo for Kafka Connect with JDBC and HDFS Connectors☆0Updated 3 months ago
- ☆49Updated this week
- An Ambari Stack service package for VNC Server with the ability to install developer tools like Eclipse/IntelliJ/Maven as well to 'remote…☆28Updated 8 years ago
- Utility to easily copy files into HDFS☆69Updated 4 years ago
- ☆78Updated 9 years ago
- Coding exercises for Apache Spark☆103Updated 9 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 9 years ago
- HDP Data Science/Machine Learning demo☆37Updated 9 years ago
- ☆63Updated this week
- Training materials for Strata, AMP Camp, etc☆150Updated 8 years ago
- ☆33Updated this week
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38Updated 5 years ago
- Simple Spark Application☆76Updated 9 months ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 3 years ago
- Materials for various Hadoop & Nifi related workshops☆49Updated 5 years ago
- An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project…☆29Updated 8 years ago
- ☆22Updated last year
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 8 years ago
- ☆62Updated this week
- ☆48Updated 6 years ago