Cascading/pattern

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cascading/pattern)

Cascading / pattern

Machine Learning for Cascading

☆85

Alternatives and similar repositories for pattern

Users that are interested in pattern are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cascading / lingual
View on GitHub
Stand-alone ANSI SQL for Cascading on Apache Hadoop
☆48Jan 25, 2018Updated 8 years ago
LiveRamp / cascading_ext
View on GitHub
cascading_ext is a collection of tools built on top of the Cascading platform which make it easy to build, debug, and run simple and high…
☆58Feb 25, 2026Updated 4 months ago
ParallelAI / SpyGlass
View on GitHub
Cascading and Scalding wrapper for HBase with advanced read features
☆54Feb 11, 2020Updated 6 years ago
rbrush / clara-storm
View on GitHub
Forward-chaining rules over Storm
☆29Nov 26, 2014Updated 11 years ago
alienrobotwizard / varaha
View on GitHub
Machine learning and natural language processing with Apache Pig
☆53Dec 17, 2013Updated 12 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
clojurewerkz / romulan
View on GitHub
LMAX Disruptor in Clojure embrace
☆15May 18, 2013Updated 13 years ago
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
Cascading / cascading
View on GitHub
All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…
☆332Nov 29, 2018Updated 7 years ago
Cascading / scalding-tutorial
View on GitHub
The Scalding tutorial as a standalone SBT project
☆51Oct 16, 2017Updated 8 years ago
ccsevers / scalding-linalg
View on GitHub
Linear algebra routines for Scalding.
☆21May 23, 2013Updated 13 years ago
josephxsxn / moya
View on GitHub
Memcached on YARN
☆19Jun 2, 2014Updated 12 years ago
ndimiduk / lein-hadoop
View on GitHub
leiningen plugin for generating hadoop-compatible jars
☆28Jan 30, 2012Updated 14 years ago
yods / storm-ml-play
View on GitHub
Experiments with VowPal Wabbit Machine Learning & Storm
☆26Apr 29, 2013Updated 13 years ago
davidandrzej / chisel
View on GitHub
Clojure wrapper for LDA topic modeling in MALLET
☆33Sep 6, 2011Updated 14 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Cascading / Impatient
View on GitHub
source examples to support the "Cascading for the Impatient" blog post series
☆79Aug 30, 2016Updated 9 years ago
intel / graphbuilder
View on GitHub
The GraphBuilder library provides functions to construct large scale graphs. It is implemented on Apache Hadoop.
☆101Oct 9, 2014Updated 11 years ago
twitter / summingbird
View on GitHub
Streaming MapReduce with Scalding and Storm
☆2,123Jan 19, 2022Updated 4 years ago
shivaram / spark-ec2
View on GitHub
Scripts used to setup a Spark cluster on EC2
☆21Mar 24, 2016Updated 10 years ago
alexott / clojure-hadoop
View on GitHub
Library to aid writing Hadoop jobs in Clojure.
☆98Nov 21, 2013Updated 12 years ago
tellapart / TellApart-Hadoop-Utils
View on GitHub
Utilities for working with Hadoop and Cascading
☆19Feb 8, 2011Updated 15 years ago
BertrandDechoux / cascading.learn
View on GitHub
Test driven learning of Cascading.
☆40Feb 11, 2020Updated 6 years ago
TheClimateCorporation / lemur
View on GitHub
Lemur is a tool to launch hadoop jobs locally or on EMR, based on a configuration file, referred to as a jobdef. The jobdef file describe…
☆84Oct 16, 2017Updated 8 years ago
damballa / parkour
View on GitHub
Hadoop MapReduce in idiomatic Clojure.
☆255Mar 23, 2016Updated 10 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
infochimps-labs / data_science_fun_pack
View on GitHub
Meta-repository of big data tools -- source and essential plugins for hadoop, pig, wukong, storm, kafka etc.
☆30Jun 29, 2014Updated 12 years ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
quintona / storm-pattern
View on GitHub
A fork of cascading patterns, but implemented for trident
☆71Dec 16, 2023Updated 2 years ago
amplab / MLI
View on GitHub
An API for Distributed Machine Learning
☆156Sep 22, 2016Updated 9 years ago
mengxr / spark-als
View on GitHub
Another, hopefully better, implementation of ALS on Spark
☆14May 20, 2015Updated 11 years ago
roman / river
View on GitHub
A monadic stream library in Clojure (port of Haskell's enumerator).
☆18Mar 17, 2012Updated 14 years ago
technomancy / javert
View on GitHub
inspector
☆24May 15, 2013Updated 13 years ago
Cascading / vagrant-cascading-hadoop-cluster
View on GitHub
Deploying apache-hadoop in a virtualized cluster as easy as 1-2-3.
☆127Jan 16, 2017Updated 9 years ago
yahoo / storm-yarn
View on GitHub
Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.
☆418Jul 21, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Cascading / fluid
View on GitHub
A Fluent Java API for Cascading
☆22Jun 14, 2017Updated 9 years ago
utcompling / Scalabha
View on GitHub
Scala utilities for teaching computational linguistics and prototyping algorithms.
☆43Dec 29, 2012Updated 13 years ago
cwensel / cascading.multitool
View on GitHub
Cascading.Multitool is a sed and grep command line tool for Apache Hadoop.
☆21May 1, 2012Updated 14 years ago
twitter-archive / pycascading
View on GitHub
A Python wrapper for Cascading
☆220Dec 30, 2019Updated 6 years ago
etsy / Sahale
View on GitHub
A Cascading Workflow Visualizer
☆83May 9, 2023Updated 3 years ago
datasalt / splout-db
View on GitHub
A web-latency SQL spout for Hadoop.
☆51Jan 25, 2021Updated 5 years ago
nathanmarz / dfs-datastores
View on GitHub
Dead-simple vertical partitioning, compression, appends, and consolidation of data on a distributed filesystem.
☆215Jun 29, 2016Updated 10 years ago