This repository contains the Pig Latin scripts, UDFs and datasets used in the book Pig Design Patterns by Pradeep Pasupuleti, published by Packt.
☆23Apr 9, 2014Updated 12 years ago
Alternatives and similar repositories for pig-design-patterns
Users that are interested in pig-design-patterns are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains the exercises, I did while reading through the Learn Python the hard way book by Zed A. Shaw☆16Sep 23, 2017Updated 8 years ago
- ☆44Jul 24, 2017Updated 8 years ago
- Data ingestion examples☆11Feb 12, 2015Updated 11 years ago
- SQL Windowing Functions for Hadoop☆65Jun 20, 2022Updated 3 years ago
- Data and example code for Programming Pig, by Alan F. Gates☆186Oct 15, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Device Manager Demo is designed to demonstrate a fully functioning modern Data/IoT application. It is a Lambda architecture built usi…☆13Aug 31, 2017Updated 8 years ago
- Piglet is a DSL for writing Pig scripts in Ruby☆83Jul 21, 2010Updated 15 years ago
- sample oozie workflows☆17Jun 13, 2017Updated 9 years ago
- ☆10Jul 15, 2022Updated 3 years ago
- Choosing a fantasy football team using spark, hive, python, and really just about anything.☆20Feb 13, 2015Updated 11 years ago
- Databricks Azure DevOps Tutorial☆17Apr 8, 2019Updated 7 years ago
- A Ruby/Sinatra web application to browse data on a Chef server☆68Dec 1, 2022Updated 3 years ago
- Fixed-width data source for Spark SQL and DataFrames☆10Oct 25, 2016Updated 9 years ago
- ☆10Oct 16, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Tools for Hadoop☆24Feb 27, 2012Updated 14 years ago
- Domain-specific language to help build and maintain AWS Data Pipelines☆26Aug 22, 2018Updated 7 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Dec 18, 2014Updated 11 years ago
- A Spring-based Test Framework supporting Unit and Integration testing for Spring Boot applications using Spring Data with either Apache G…☆23Nov 29, 2023Updated 2 years ago
- practice fusion contest☆11Dec 10, 2012Updated 13 years ago
- React client app, redux stage management, passport oauth2, paypal rest api and swagger based krakenjs node.js server☆14Nov 17, 2016Updated 9 years ago
- Chrome extension: password generator from master key using PBKDF2 with SHA-256.☆19Sep 14, 2015Updated 10 years ago
- ☆51May 21, 2026Updated 3 weeks ago
- Node.js application - simple notes management using Express, Postgres, Objection.js, Docker, Socket.io, Bluebird Promises☆15Feb 10, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A python script that looks for special lines in a markdown file and uses those lines to convert, clean up, and insert content from URLs i…☆16Dec 9, 2012Updated 13 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- A Javascript library that introduces R idioms and vectorization☆17May 23, 2018Updated 8 years ago
- Code repository for MDX with Microsoft SQL Server 2016 Analysis Services Cookbook by Packt☆12Jan 30, 2023Updated 3 years ago
- ☆195Jun 21, 2022Updated 3 years ago
- Cube: A system for time series visualization.☆24Jan 10, 2013Updated 13 years ago
- Python for Finance: Investment Fundamentals and Data Analytics, published by Packt☆47May 13, 2025Updated last year
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54May 20, 2015Updated 11 years ago
- A grouping of Apache Pig examples.☆65Oct 13, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Intel TBB Package for R/Rcpp☆15Jul 7, 2014Updated 11 years ago
- 开源关系型数据库PostgreSQL生态系列资源^_^☆26Apr 15, 2020Updated 6 years ago
- Some dockerfiles for deep learning☆19Feb 19, 2018Updated 8 years ago
- Github mirror of "analytics/kafkatee" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆20Nov 23, 2023Updated 2 years ago
- The JSON API Browser☆40Dec 5, 2013Updated 12 years ago
- ☆41Jun 7, 2012Updated 14 years ago
- Code repository for Fast Data Processing Systems with SMACK Stack by Packt☆18Jan 18, 2023Updated 3 years ago