Machine learning and natural language processing with Apache Pig
☆53Dec 17, 2013Updated 12 years ago
Alternatives and similar repositories for varaha
Users that are interested in varaha are comparing it to the libraries listed below
Sorting:
- A grouping of Apache Pig examples.☆65Oct 13, 2020Updated 5 years ago
- Examples of use of pig scripting languages capabilities☆39Aug 1, 2016Updated 9 years ago
- ☆45Feb 16, 2013Updated 13 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆52Jul 19, 2013Updated 12 years ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Apr 9, 2013Updated 12 years ago
- Python Client for WebHDFS REST API☆43May 8, 2015Updated 10 years ago
- collaborative web tool to enrich content☆12Nov 13, 2011Updated 14 years ago
- Piglet is a DSL for writing Pig scripts in Ruby☆83Jul 21, 2010Updated 15 years ago
- useful JVM classes for the mrjob hadoop streaming framework☆31Jun 20, 2013Updated 12 years ago
- BSON support for PostgreSQL☆29Dec 12, 2013Updated 12 years ago
- A project for code to create models from existing corpora and distribute models.☆42Apr 11, 2012Updated 13 years ago
- A few examples for LMAX disruptor☆17Aug 1, 2011Updated 14 years ago
- Demonstration of how dedupe might be used as geocoder☆17Jun 21, 2022Updated 3 years ago
- Git pre-commits hooks for doing Python code formatting checks☆20Feb 7, 2012Updated 14 years ago
- Notes on Algebra and Recursive Data Types☆10Oct 7, 2011Updated 14 years ago
- Continuous Streaming SQL Queries for Flume☆96Dec 30, 2011Updated 14 years ago
- python wrapper of fast C++ LLE code☆18May 18, 2011Updated 14 years ago
- A JRuby DSL for Cascading☆41Sep 23, 2015Updated 10 years ago
- A Scala library to create RxScala observables and observers from Kafka consumers and producers☆11Dec 26, 2017Updated 8 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆284Apr 25, 2018Updated 7 years ago
- Explore various options of domain modeling with scalaz☆25Jul 21, 2012Updated 13 years ago
- ☆12Apr 21, 2019Updated 6 years ago
- Flexible data workflow glue.☆29Jun 22, 2011Updated 14 years ago
- A sample project to teach myself the Scala language combined with the fun of building a MUD☆34Apr 17, 2012Updated 13 years ago
- ☆12Jul 18, 2016Updated 9 years ago
- Toy single-machine implementation of the Pregel graph-based framework☆119Jan 5, 2017Updated 9 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆50Jul 4, 2011Updated 14 years ago
- Computational biology code samples☆11Feb 28, 2018Updated 8 years ago
- Scala Data access for NoSQL databases☆47Jun 4, 2013Updated 12 years ago
- Machine Learning for Cascading☆84Jun 12, 2015Updated 10 years ago
- off the shelf infrastructure☆25Dec 18, 2023Updated 2 years ago
- Hot Potato is an open source real-time processing framework written in Ruby.☆29Jul 10, 2012Updated 13 years ago
- ☆71Mar 24, 2018Updated 7 years ago
- Apple Music to Discord is a macOS app that syncs your Apple Music listening activity with Discord Rich Presence, showing track info, albu…☆26Feb 20, 2025Updated last year
- Example of an immutable "Invoice" domain object in Scala☆31Feb 14, 2011Updated 15 years ago
- A collection of themes for ZK☆17Jul 21, 2025Updated 8 months ago
- Simple status functionality for Rails models.☆27Dec 23, 2015Updated 10 years ago
- Mavuno: A Hadoop-Based Text Mining Toolkit☆47Feb 7, 2012Updated 14 years ago
- Mahout vector encoding for pig☆53Nov 20, 2022Updated 3 years ago