Programming MapReduce with Scalding
☆82Dec 5, 2015Updated 10 years ago
Alternatives and similar repositories for ProgrammingWithScalding
Users that are interested in ProgrammingWithScalding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cascading and Scalding wrapper for HBase with advanced read features☆55Feb 11, 2020Updated 6 years ago
- Spark Sample Project☆11Dec 15, 2015Updated 10 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30May 2, 2019Updated 7 years ago
- Scripts for making Hadoop deployments in AWS easy☆10Feb 26, 2014Updated 12 years ago
- Command line tools for the parquet project☆44Jul 10, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Mar 14, 2016Updated 10 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Dec 28, 2016Updated 9 years ago
- Contain your coding agents (literally)☆117Apr 18, 2026Updated last month
- An sbt plugin to resolve dependencies using Aether☆13Apr 10, 2025Updated last year
- Open Source Java Textbook☆20Aug 27, 2025Updated 9 months ago
- ☆10Nov 15, 2015Updated 10 years ago
- Random notes on distributed computing and stuff.☆10Jul 12, 2016Updated 9 years ago
- Research and Code for Erlang Factory 2015 Talk☆10May 4, 2016Updated 10 years ago
- Implementation of Paxos☆21Apr 4, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Automatically exported from code.google.com/p/jbirch☆12Sep 6, 2022Updated 3 years ago
- A Scala API for Cascading☆3,522May 28, 2023Updated 3 years ago
- Graph algorithms implemented in GraphX and Spark styles☆15Apr 26, 2015Updated 11 years ago
- Locality-sensitive hashing in PySpark.☆27Mar 11, 2015Updated 11 years ago
- Scalable Machine Learning in Scalding☆360Feb 16, 2018Updated 8 years ago
- Spark Implementation of BIRCH Clustering algorithm☆13Feb 18, 2020Updated 6 years ago
- A Cascading Workflow Visualizer☆83May 9, 2023Updated 3 years ago
- JSON Serde for Hive☆21Oct 13, 2011Updated 14 years ago
- Simple Spark app that reads and writes Avro data☆31Apr 13, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Terraform module to create an Elastic Kubernetes (EKS) cluster and associated worker instances on AWS☆14Aug 26, 2020Updated 5 years ago
- Some recommendation algorithms and research☆12Sep 16, 2016Updated 9 years ago
- Slides and examples for a presentation on the State datatype provided by Scalaz 7☆26Dec 8, 2016Updated 9 years ago
- ☆17Aug 15, 2015Updated 10 years ago
- Simple UDF to split JSON arrays into Hive arrays☆10Jun 24, 2016Updated 9 years ago
- ☆12Aug 26, 2021Updated 4 years ago
- Simple Audit Mechanism for Rails Applications☆47May 3, 2011Updated 15 years ago
- ☆11Oct 21, 2024Updated last year
- Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka☆20Jan 29, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Efficient solutions to Project Euler (https://projecteuler.net/) problems.☆11Feb 12, 2017Updated 9 years ago
- Cassandra Dataset Manager☆14Sep 1, 2017Updated 8 years ago
- Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…☆13Jul 21, 2021Updated 4 years ago
- PyGotham 2017: Spark Streaming for World Domination (and other projects)☆10Oct 5, 2017Updated 8 years ago
- Our workflow, gemified.☆11Dec 14, 2015Updated 10 years ago
- What's an ensemble of leaders? A riak_governor.☆10Apr 13, 2015Updated 11 years ago
- A Scala DSL (API) designed for monitoring event streams, such as for example log files. Based on data parameterized automata and temporal…☆11Dec 21, 2020Updated 5 years ago