Cascading/cascading

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cascading/cascading)

Cascading / cascading

All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on various cluster computing platforms.

☆332

Alternatives and similar repositories for cascading

Users that are interested in cascading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cwensel / cascading
View on GitHub
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
☆355Apr 8, 2025Updated last year
Cascading / lingual
View on GitHub
Stand-alone ANSI SQL for Cascading on Apache Hadoop
☆48Jan 25, 2018Updated 8 years ago
Cascading / Impatient
View on GitHub
source examples to support the "Cascading for the Impatient" blog post series
☆79Aug 30, 2016Updated 9 years ago
twitter / scalding
View on GitHub
A Scala API for Cascading
☆3,523May 28, 2023Updated 3 years ago
twitter-archive / pycascading
View on GitHub
A Python wrapper for Cascading
☆220Dec 30, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ParallelAI / SpyGlass
View on GitHub
Cascading and Scalding wrapper for HBase with advanced read features
☆54Feb 11, 2020Updated 6 years ago
Cascading / pattern
View on GitHub
Machine Learning for Cascading
☆85Jun 12, 2015Updated 11 years ago
Cascading / cascading.samples
View on GitHub
Sample applications using Cascading
☆20Jun 7, 2015Updated 11 years ago
LiveRamp / cascading_ext
View on GitHub
cascading_ext is a collection of tools built on top of the Cascading platform which make it easy to build, debug, and run simple and high…
☆58Feb 25, 2026Updated 5 months ago
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
nathanmarz / cascalog
View on GitHub
Data processing on Hadoop without the hassle.
☆1,373May 18, 2023Updated 3 years ago
Cascading / maple
View on GitHub
All the Cascading taps you need and love.
☆39Mar 4, 2019Updated 7 years ago
cwensel / cascading.hbase
View on GitHub
HBase adapters for Cascading
☆47Aug 9, 2009Updated 16 years ago
Cascading / scalding-tutorial
View on GitHub
The Scalding tutorial as a standalone SBT project
☆51Oct 16, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Cascading / fluid
View on GitHub
A Fluent Java API for Cascading
☆22Jun 14, 2017Updated 9 years ago
Cascading / SampleRecommender
View on GitHub
a simple kind of social recommender
☆32Jun 15, 2015Updated 11 years ago
twitter / summingbird
View on GitHub
Streaming MapReduce with Scalding and Storm
☆2,123Jan 19, 2022Updated 4 years ago
nathanmarz / elephantdb
View on GitHub
Distributed database specialized in exporting key/value data from Hadoop
☆558Jun 27, 2014Updated 12 years ago
dataArtisans / cascading-flink
View on GitHub
Cascading on Apache Flink®
☆54Feb 5, 2024Updated 2 years ago
nathanmarz / dfs-datastores
View on GitHub
Dead-simple vertical partitioning, compression, appends, and consolidation of data on a distributed filesystem.
☆215Jun 29, 2016Updated 10 years ago
YahooArchive / howl
View on GitHub
Common metadata layer for Hadoop's Map Reduce, Pig, and Hive
☆77Feb 17, 2011Updated 15 years ago
twitter / storehaus
View on GitHub
Storehaus is a library that makes it easy to work with asynchronous key value stores
☆465Jul 17, 2020Updated 6 years ago
julianhyde / optiq
View on GitHub
Obsolete - superseded by Apache Calcite
☆237Jan 20, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mesos / spark
View on GitHub
Lightning-fast cluster computing in Java, Scala and Python.
☆1,419Apr 8, 2014Updated 12 years ago
cwensel / riffle
View on GitHub
Annotations and Classes for managing and executing dependent processes
☆39Apr 15, 2021Updated 5 years ago
gmarabout-zz / cascading.json
View on GitHub
Some JSON utility classes for Cascading.
☆21Oct 13, 2020Updated 5 years ago
echen / scaldingale
View on GitHub
Movie recommendations and more in MapReduce and Scalding
☆117Feb 11, 2013Updated 13 years ago
nathanmarz / storm
View on GitHub
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
☆8,770Aug 16, 2017Updated 8 years ago
julienledem / brennus
View on GitHub
Builder pattern to generate java classes
☆17Aug 12, 2021Updated 4 years ago
ThinkBigAnalytics / scalding-workshop
View on GitHub
A half-day workshop on Scalding, the Scala API for Cascading
☆48Mar 21, 2016Updated 10 years ago
apache / giraph
View on GitHub
Mirror of Apache Giraph
☆620Apr 14, 2023Updated 3 years ago
twitter / hraven
View on GitHub
hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format
☆129Jan 14, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
square / cascading2-protobufs
View on GitHub
Cascading 2 library for working with Protocol Buffers (Scheme, Serialization, and maybe even some functions/filters)
☆19Sep 19, 2024Updated last year
addthis / meshy
View on GitHub
☆17Jan 7, 2020Updated 6 years ago
apache / pig
View on GitHub
Mirror of Apache Pig
☆687May 15, 2026Updated 2 months ago
cwensel / cascading.samples
View on GitHub
Sample applications using Cascading
☆38Oct 11, 2011Updated 14 years ago
square / cascading-helpers
View on GitHub
A whole bunch of functions, filters, and other tools that make writing Cascading flows a joy
☆55Mar 19, 2023Updated 3 years ago
bixo / bixo
View on GitHub
Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…
☆143Jul 7, 2022Updated 4 years ago
mudphone / hbase-runner
View on GitHub
Dear StrongBad, this is a simple utility library for working with HBase in the REPL.
☆23Jul 14, 2010Updated 16 years ago