apache/crunch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/crunch)

apache / crunch

Mirror of Apache Crunch (Incubating)

☆110

Alternatives and similar repositories for crunch

Users that are interested in crunch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

spotify / crunch-lib
View on GitHub
Useful reusable pipeline components for Crunch jobs
☆27Feb 10, 2015Updated 11 years ago
larsgeorge / maven-archetype-hadoop
View on GitHub
Provides a simple archetype to create MapReduce jobs with Maven.
☆24Dec 3, 2010Updated 15 years ago
jwills / target-duckdb
View on GitHub
A Singer.io target for DuckDB
☆19Feb 11, 2026Updated 5 months ago
apache / pig
View on GitHub
Mirror of Apache Pig
☆687May 15, 2026Updated 2 months ago
apache / apex-core
View on GitHub
Mirror of Apache Apex core
☆350Jun 7, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
malike / elasticsearch-kafka-watch
View on GitHub
A custom watcher plugin for Elasticsearch that feeds Apache Kafka
☆11Mar 9, 2018Updated 8 years ago
kasun04 / grpc-microservices
View on GitHub
☆12Nov 3, 2018Updated 7 years ago
leigu / brave-tracer-example
View on GitHub
☆13Oct 28, 2015Updated 10 years ago
tomslabs / avro-utils
View on GitHub
Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming
☆26Sep 10, 2013Updated 12 years ago
apache / tajo
View on GitHub
Mirror of Apache Tajo
☆136May 11, 2020Updated 6 years ago
kite-sdk / kite
View on GitHub
Kite SDK
☆393Nov 1, 2022Updated 3 years ago
amcjen / node-udt
View on GitHub
A compiled (read: fast!) node.js library for the UDT4 high-speed data transfer protocol
☆24Aug 27, 2011Updated 14 years ago
tdoehmen / duckdq
View on GitHub
☆35Jul 23, 2023Updated 2 years ago
apache / airavata
View on GitHub
A general purpose Distributed Systems Framework
☆153Jul 10, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
bj4096 / 1075work
View on GitHub
☆14Mar 29, 2019Updated 7 years ago
vinicius0026 / hapi-vue-ssr
View on GitHub
Vue.js app server-side rendered with Hapi.js
☆12Dec 6, 2022Updated 3 years ago
PacktPublishing / Java-Data-Science-Cookbook
View on GitHub
Code repository for Java Data Science Cookbook, published by Packt
☆25Jan 30, 2023Updated 3 years ago
wayfair-archive / redux-ledger
View on GitHub
Async Redux Testing Middleware
☆11Mar 15, 2022Updated 4 years ago
felixmc / custom-logger
View on GitHub
simple, customizable logger for node
☆11Jan 5, 2016Updated 10 years ago
awnuxkjy / naive-bayesian
View on GitHub
naive bayesian java demo
☆10Aug 30, 2013Updated 12 years ago
apache / trafodion
View on GitHub
Apache Trafodion
☆246Jun 7, 2021Updated 5 years ago
giltene / HeapFragger
View on GitHub
HeapFragger: A heap fragmentation inducer
☆61Sep 19, 2021Updated 4 years ago
mbonaci / mbo-spark
View on GitHub
Spark exploration
☆19Apr 9, 2015Updated 11 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
rei-m / android_hyakuninisshu
View on GitHub
app for 百人一首
☆11Jul 25, 2025Updated 11 months ago
apache / commons-weaver
View on GitHub
Apache Commons Weaver
☆27Jul 13, 2026Updated last week
Kayrnt / duckdb_mysql_scanner
View on GitHub
DuckDB extension for MySQL
☆15Mar 17, 2024Updated 2 years ago
milinda / samza-sql
View on GitHub
SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka
☆30Jun 8, 2016Updated 10 years ago
OlegIlyenko / hacking-scala-blog
View on GitHub
Blog posts from hacking-scala.tumblr.com
☆20Jun 20, 2016Updated 10 years ago
liaco / mimir
View on GitHub
☆16Jul 25, 2025Updated 11 months ago
jetty-project / embedded-servlet-3.1
View on GitHub
Example of Embedded Jetty with Servlet 3.1
☆15Aug 10, 2021Updated 4 years ago
vladislav-karamfilov / TelerikAcademy
View on GitHub
This repository stores all projects I've done as Telerik Software Academy student
☆13Dec 28, 2014Updated 11 years ago
apache / chukwa
View on GitHub
Mirror of Apache Chukwa
☆85Mar 31, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
initLab / fauna
View on GitHub
Small hackerspace management automation system in use by init Lab
☆15Updated this week
ept / avrodoc
View on GitHub
Documentation tool for Avro schemas
☆150Nov 15, 2019Updated 6 years ago
spotify / code-of-conduct
View on GitHub
Spotify FOSS Community Code of Conduct
☆26Oct 13, 2022Updated 3 years ago
sebastian-r-schmidt / logicaldecoding
View on GitHub
Parses PostgreSQL Logical Decoding output
☆17Aug 31, 2016Updated 9 years ago
debasishg / scala-snippets
View on GitHub
Various Scala snippets of interest - some of them plagiarised
☆28Apr 1, 2014Updated 12 years ago
Storm-Applied / C2-Github-commit-count
View on GitHub
Github commit count topology
☆15Jan 31, 2015Updated 11 years ago
apache / incubator-sentry
View on GitHub
Mirror of Apache Sentry
☆35Oct 18, 2019Updated 6 years ago