spotify / crunch-libLinks
Useful reusable pipeline components for Crunch jobs
☆27Updated 10 years ago
Alternatives and similar repositories for crunch-lib
Users that are interested in crunch-lib are comparing it to the libraries listed below
Sorting:
- An Apache Storm IMetricsConsumer that forwards Storm's built-in metrics to a Graphite server for real-time graphing, visualization, and o…☆76Updated 2 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆154Updated 3 years ago
- Delimited file loader for Cassandra☆199Updated 6 years ago
- metrics-datadog☆186Updated 2 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆127Updated 4 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- Simple JVM Profiler Using StatsD and Other Metrics Backends☆334Updated 3 weeks ago
- Tools for parsing, creating and doing other fun stuff with sstables☆163Updated 8 years ago
- Stubbed Cassandra☆87Updated 6 years ago
- Metrics produced to Kafka and consumers for monitoring them☆102Updated 11 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆330Updated 3 years ago
- ☆27Updated 8 years ago
- production heap profiling for the JVM. compatible with google-perftools.☆396Updated 9 years ago
- Hadoop output committers for S3☆113Updated 5 years ago
- ☆76Updated 10 years ago
- Cassandra schema migration tool for java☆99Updated 3 years ago
- Statsd reporter for codahale/metrics.☆94Updated 7 years ago
- Low level integration of Spark and Kafka☆130Updated 7 years ago
- Coral is a real-time analytics and data science platform. It transforms streaming events and extract patterns from data via RESTful APIs.…☆148Updated 6 years ago
- Tools for working with parquet, impala, and hive☆134Updated 5 years ago
- Documentation tool for Avro schemas☆150Updated 6 years ago
- ☆175Updated 4 years ago
- reactive kafka client☆161Updated 5 years ago
- A java library for stored queries☆378Updated 2 years ago
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆465Updated 5 years ago
- A high precision Java CMS optimizer☆271Updated 7 years ago
- Programming MapReduce with Scalding☆82Updated 10 years ago
- Database migration (evolution) tool for Apache Cassandra☆109Updated 4 months ago
- A reasonably complete implementation of the Universal Scalability Law model.☆202Updated 6 years ago