cwensel/cascading

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cwensel/cascading)

cwensel / cascading

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.

☆355

Alternatives and similar repositories for cascading

Users that are interested in cascading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cascading / cascading
View on GitHub
All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…
☆332Nov 29, 2018Updated 7 years ago
cwensel / cascading.hbase
View on GitHub
HBase adapters for Cascading
☆47Aug 9, 2009Updated 16 years ago
cwensel / cascading.samples
View on GitHub
Sample applications using Cascading
☆38Oct 11, 2011Updated 14 years ago
LiveRamp / cascading_ext
View on GitHub
cascading_ext is a collection of tools built on top of the Cascading platform which make it easy to build, debug, and run simple and high…
☆58Feb 25, 2026Updated 5 months ago
Cascading / cascading-thrift
View on GitHub
Serializer and comparator for using Thrift objects in Cascading or Cascalog
☆17Dec 31, 2014Updated 11 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Cascading / cascading-dbmigrate
View on GitHub
Tool to help users migrate large relational databases into Hadoop clusters.
☆67Mar 23, 2012Updated 14 years ago
cwensel / riffle
View on GitHub
Annotations and Classes for managing and executing dependent processes
☆39Apr 15, 2021Updated 5 years ago
nathanmarz / cascading-batch-query
View on GitHub
Optimized joins using bloom filters on Hadoop via Cascading.
☆22Sep 25, 2009Updated 16 years ago
Cascading / maple
View on GitHub
All the Cascading taps you need and love.
☆39Mar 4, 2019Updated 7 years ago
nathanmarz / cascalog
View on GitHub
Data processing on Hadoop without the hassle.
☆1,373May 18, 2023Updated 3 years ago
twitter / scalding
View on GitHub
A Scala API for Cascading
☆3,523May 28, 2023Updated 3 years ago
nathanmarz / elephantdb-cascalog
View on GitHub
Seamless integration of ElephantDB with Cascalog
☆18Jan 3, 2012Updated 14 years ago
cwensel / cascading.multitool
View on GitHub
Cascading.Multitool is a sed and grep command line tool for Apache Hadoop.
☆21May 1, 2012Updated 14 years ago
cwensel / cascading.jdbc
View on GitHub
JDBC adapter for Cascading
☆23Jun 27, 2009Updated 17 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ParallelAI / SpyGlass
View on GitHub
Cascading and Scalding wrapper for HBase with advanced read features
☆54Feb 11, 2020Updated 6 years ago
nathanmarz / cascalog-workshop
View on GitHub
Materials for Cascalog workshop
☆18Sep 17, 2011Updated 14 years ago
Cascading / lingual
View on GitHub
Stand-alone ANSI SQL for Cascading on Apache Hadoop
☆48Jan 25, 2018Updated 8 years ago
gmarabout-zz / cascading.jruby
View on GitHub
A JRuby DSL for Cascading
☆45Apr 14, 2011Updated 15 years ago
etsy / jading
View on GitHub
cascading.jruby build and execution tool
☆16Sep 23, 2015Updated 10 years ago
gmarabout-zz / cascading.json
View on GitHub
Some JSON utility classes for Cascading.
☆21Oct 13, 2020Updated 5 years ago
nathanmarz / elephantdb
View on GitHub
Distributed database specialized in exporting key/value data from Hadoop
☆558Jun 27, 2014Updated 12 years ago
cwensel / bash-emr
View on GitHub
Simple bash functions for manipulating Amazon Elastic MapReduce clusters
☆45Jan 5, 2016Updated 10 years ago
Cascading / cascading-jdbc
View on GitHub
cascading schemes and taps for JDBC
☆27Jun 15, 2016Updated 10 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
sritchie / midje-cascalog
View on GitHub
Cascalog functions for Midje.
☆20Feb 16, 2013Updated 13 years ago
Cascading / Impatient
View on GitHub
source examples to support the "Cascading for the Impatient" blog post series
☆79Aug 30, 2016Updated 9 years ago
jghoman / haivvreo
View on GitHub
Hive + Avro. Serde for working with Avro in Hive
☆60Dec 16, 2023Updated 2 years ago
Cascading / cascading.samples
View on GitHub
Sample applications using Cascading
☆20Jun 7, 2015Updated 11 years ago
square / cascading-helpers
View on GitHub
A whole bunch of functions, filters, and other tools that make writing Cascading flows a joy
☆55Mar 19, 2023Updated 3 years ago
sritchie / jackknife
View on GitHub
Useful Clojure utilities!
☆16Oct 4, 2015Updated 10 years ago
square / cascading2-protobufs
View on GitHub
Cascading 2 library for working with Protocol Buffers (Scheme, Serialization, and maybe even some functions/filters)
☆19Sep 19, 2024Updated last year
Cascading / cascading-hive
View on GitHub
Integration for Cascading and Apache Hive
☆25Oct 31, 2017Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mesos / spark
View on GitHub
Lightning-fast cluster computing in Java, Scala and Python.
☆1,419Apr 8, 2014Updated 12 years ago
twitter / hadoop-lzo
View on GitHub
Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
☆548Apr 24, 2024Updated 2 years ago
nathanmarz / storm
View on GitHub
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
☆8,770Aug 16, 2017Updated 8 years ago
twitter-archive / pycascading
View on GitHub
A Python wrapper for Cascading
☆220Dec 30, 2019Updated 6 years ago
revelytix / carbonite
View on GitHub
Clojure library for serializing Clojure data using Kryo
☆59Jan 21, 2012Updated 14 years ago
ExpediaGroup / plunger
View on GitHub
A unit testing framework for the Cascading data processing platform.
☆25Aug 25, 2021Updated 4 years ago
twitter / summingbird
View on GitHub
Streaming MapReduce with Scalding and Storm
☆2,123Jan 19, 2022Updated 4 years ago