Cascading/scalding-tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cascading/scalding-tutorial)

Cascading / scalding-tutorial

The Scalding tutorial as a standalone SBT project

☆51

Alternatives and similar repositories for scalding-tutorial

Users that are interested in scalding-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

snowplow-archive / scalding-example-project
View on GitHub
The Scalding WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR
☆82Aug 28, 2014Updated 11 years ago
ThinkBigAnalytics / scalding-workshop
View on GitHub
A half-day workshop on Scalding, the Scala API for Cascading
☆48Mar 21, 2016Updated 10 years ago
scalding-io / ProgrammingWithScalding
View on GitHub
Programming MapReduce with Scalding
☆82Dec 5, 2015Updated 10 years ago
tresata / ganitha
View on GitHub
scalding powered machine learning
☆109Nov 18, 2014Updated 11 years ago
Cascading / Impatient
View on GitHub
source examples to support the "Cascading for the Impatient" blog post series
☆79Aug 30, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Cascading / cascading-jdbc
View on GitHub
cascading schemes and taps for JDBC
☆27Jun 15, 2016Updated 10 years ago
ogrodnek / spark-plug
View on GitHub
scala driver for launching Amazon EMR jobs
☆40Feb 10, 2016Updated 10 years ago
ParallelAI / SpyGlass
View on GitHub
Cascading and Scalding wrapper for HBase with advanced read features
☆54Feb 11, 2020Updated 6 years ago
holdenk / fastdataprocessingwithspark-sharkexamples
View on GitHub
Examples for Fast Data Processing with Spark example Shark project
☆22Jun 11, 2013Updated 13 years ago
tuplejump / embedded-kafka
View on GitHub
Embedded Kafka for testing and quick prototyping.
☆14Apr 19, 2016Updated 10 years ago
Cascading / vagrant-cascading-hadoop-cluster
View on GitHub
Deploying apache-hadoop in a virtualized cluster as easy as 1-2-3.
☆127Jan 16, 2017Updated 9 years ago
twitter / scalding
View on GitHub
A Scala API for Cascading
☆3,522May 28, 2023Updated 3 years ago
metamx / scala-util
View on GitHub
Scala stuff
☆18Jun 13, 2019Updated 7 years ago
holdenk / fastdataprocessingwithsparkexamples
View on GitHub
Examples for Fast Data Processing with Spark
☆59Sep 10, 2013Updated 12 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alaiacano / scalding-nb
View on GitHub
Naive Bayes classifier written in Scalding
☆27Feb 23, 2014Updated 12 years ago
NICTA / scoobi
View on GitHub
A Scala productivity framework for Hadoop.
☆479Jul 1, 2022Updated 4 years ago
med-at-scale / high-health
View on GitHub
Integrate the GA4GH schemas and probably a scala impl of the service.
☆14May 20, 2016Updated 10 years ago
echen / scaldingale
View on GitHub
Movie recommendations and more in MapReduce and Scalding
☆117Feb 11, 2013Updated 13 years ago
Cascading / cascading
View on GitHub
All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…
☆332Nov 29, 2018Updated 7 years ago
velvia / cassandra-gdelt
View on GitHub
Experiments with the GDELT dataset and Cassandra schemas.
☆25Feb 9, 2016Updated 10 years ago
ExpediaGroup / plunger
View on GitHub
A unit testing framework for the Cascading data processing platform.
☆25Aug 25, 2021Updated 4 years ago
CamDavidsonPilon / McData
View on GitHub
Repo for data surrounding fast food nutrition and ingredients
☆10Nov 11, 2018Updated 7 years ago
ifesdjeen / cascading-cassandra
View on GitHub
Modern Cassandra tap for Cascading. Actually works with Cascading 2.0, Cascalog 1.10 and supports CQL collections.
☆46Apr 21, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sujitpal / hia-examples
View on GitHub
Hadoop In Action Examples
☆40Apr 26, 2021Updated 5 years ago
twitter-archive / pycascading
View on GitHub
A Python wrapper for Cascading
☆220Dec 30, 2019Updated 6 years ago
Cascading / lingual
View on GitHub
Stand-alone ANSI SQL for Cascading on Apache Hadoop
☆48Jan 25, 2018Updated 8 years ago
etsy / Sahale
View on GitHub
A Cascading Workflow Visualizer
☆83May 9, 2023Updated 3 years ago
echen / sparta
View on GitHub
Instantly turn your data into charts and dashboards. It's like a mini Tableau.
☆27Jan 19, 2023Updated 3 years ago
yods / storm-ml-play
View on GitHub
Experiments with VowPal Wabbit Machine Learning & Storm
☆26Apr 29, 2013Updated 13 years ago
Cascading / SampleRecommender
View on GitHub
a simple kind of social recommender
☆32Jun 15, 2015Updated 11 years ago
Spark-clustering-notebook / coliseum
View on GitHub
Project defining the docker image that will support examples of algorithms created in this organization
☆13Oct 22, 2017Updated 8 years ago
ktoso / hadoop-scalding-nojartool
View on GitHub
Hadoop Tool implementation which enables extreme productivity - running MR jobs on your cluster right from your sbt shell!
☆19Feb 2, 2014Updated 12 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bythebay / pipeline
View on GitHub
Complete Pipeline Training at Big Data Scala By the Bay
☆71Oct 27, 2015Updated 10 years ago
rtrangucci / gps_in_stan
View on GitHub
☆12Feb 7, 2017Updated 9 years ago
amplab / training
View on GitHub
Training materials for Strata, AMP Camp, etc
☆150Nov 20, 2015Updated 10 years ago
ulfelder / democracy-measurement-model
View on GitHub
Replication materials for Bayesian measurement error model of dichotomous measures of democracy.
☆16May 12, 2015Updated 11 years ago
swipely / pipely
View on GitHub
Visualize pipeline definitions for AWS Data Pipeline
☆23Feb 3, 2026Updated 5 months ago
LiveRamp / cascading_ext
View on GitHub
cascading_ext is a collection of tools built on top of the Cascading platform which make it easy to build, debug, and run simple and high…
☆58Feb 25, 2026Updated 4 months ago
ogrisel / my-linux-devbox
View on GitHub
Vagrant / Salt configuration with Ubuntu to work on projects related to the scipy stack under Python 3 and Python 2
☆26Mar 17, 2014Updated 12 years ago