Useful reusable pipeline components for Crunch jobs
☆27Feb 10, 2015Updated 11 years ago
Alternatives and similar repositories for crunch-lib
Users that are interested in crunch-lib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Provides a simple archetype to create MapReduce jobs with Maven.☆24Dec 3, 2010Updated 15 years ago
- Mirror of Apache Crunch (Incubating)☆110Feb 2, 2021Updated 5 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Oct 31, 2017Updated 8 years ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆24Jan 25, 2016Updated 10 years ago
- Code to index HDFS to Solr using MapReduce☆51Nov 27, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆20Apr 4, 2022Updated 4 years ago
- Generation of arbitrary case classes / ADTs instances with Scalaprops and Magnolia☆14Apr 7, 2026Updated last week
- jq for Java☆15Jun 20, 2016Updated 9 years ago
- Apache Pig plugin for Eclipse☆12Feb 28, 2017Updated 9 years ago
- Opt - AnyVal Option-like type☆10Jan 7, 2017Updated 9 years ago
- Approximate cardinality estimation with HyperLogLog, as a Hive function☆42Dec 17, 2012Updated 13 years ago
- Crazy Simple Logging for Scala☆23Jul 29, 2015Updated 10 years ago
- A protobuf plugin to generate parquet schemas.☆13Mar 9, 2022Updated 4 years ago
- A Java package for the LDA and DMM topic models☆85Apr 17, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Addon bundle for Dropwizard to support logging to a GELF-enabled server like Graylog or logstash☆52Oct 13, 2022Updated 3 years ago
- An example SBT project which uses new-style macros☆13Jul 29, 2017Updated 8 years ago
- Code examples from scala in action book☆119Feb 9, 2022Updated 4 years ago
- Objective-C comet client using the Bayeux protocol☆29Sep 28, 2011Updated 14 years ago
- Spatial join, written in Java.☆17Oct 13, 2020Updated 5 years ago
- Simple JVM Profiler Using StatsD and Other Metrics Backends☆335Jan 15, 2026Updated 2 months ago
- Cats instances for fastparse☆18May 6, 2018Updated 7 years ago
- A git subcommand to apply skeleton repository continuously☆15Apr 2, 2026Updated last week
- ☆13Oct 4, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Hugecast - The Off-Heap Storage for Hazelcast☆22Dec 1, 2013Updated 12 years ago
- JSON parser library for Nim based on simdjson bindings☆17Jul 4, 2024Updated last year
- Flyte Flink k8s plugin.☆20Jan 29, 2025Updated last year
- Replaced by http://beaucatcher.org/, ignore this repo☆15Jul 4, 2011Updated 14 years ago
- The `netcat` container, the Swiss army knife of networking tool, Dockerized !☆10Jul 4, 2018Updated 7 years ago
- Fast, zero-copy HTML Parser written in Rust☆27Dec 6, 2025Updated 4 months ago
- Scala/Akka XMPP client library☆31Nov 27, 2013Updated 12 years ago
- Easy way to send Finagle metrics to Codahale Metrics library☆42Apr 2, 2020Updated 6 years ago
- Corrects coefficient estimates from OLS regression when data matrix contains known measurement error☆11May 17, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Very fast CBOR for Nim☆26Nov 20, 2025Updated 4 months ago
- JVMTI agent and JavaFX analyzer to gather JVM runtime information for after-the-fact analysis.☆131Jul 19, 2020Updated 5 years ago
- A React DataGrid written in TypeScript, by a team with 20+ years of experience building data grids☆18Sep 6, 2023Updated 2 years ago
- Java port of Scalactic☆10Oct 13, 2020Updated 5 years ago
- A minimal app launcher for Wayland compositors☆18Dec 4, 2024Updated last year
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆204Feb 11, 2020Updated 6 years ago
- Sample code, data, and configuration for the book☆189May 12, 2021Updated 4 years ago