ExpediaGroup / corc
An ORC File Scheme for the Cascading data processing platform.
☆14Updated 3 years ago
Alternatives and similar repositories for corc:
Users that are interested in corc are comparing it to the libraries listed below
- A unit testing framework for the Cascading data processing platform.☆25Updated 3 years ago
- Collection of utilities to allow writing java code that operates across a wide range of avro versions.☆77Updated this week
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated 10 months ago
- Measure behavior of Java applications☆43Updated 3 years ago
- High performance native memory access for Java.☆123Updated this week
- A library to expose more of Apache Spark's metrics system☆146Updated 5 years ago
- Hadoop output committers for S3☆108Updated 4 years ago
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆155Updated 2 years ago
- ☆13Updated 6 years ago
- A user friendly API for checking for and reporting on Avro schema incompatibilities.☆59Updated 10 months ago
- Probabilistic data structures for Guava.☆54Updated 4 years ago
- Fast Apache Avro serialization/deserialization library☆43Updated 4 years ago
- Cascading on Apache Flink®☆54Updated 11 months ago
- Large off-heap arrays and mmap files for Scala and Java☆401Updated 2 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆126Updated 6 years ago
- Benchmark suite for data compression library on the JVM☆217Updated 7 months ago
- JUnit rule for spinning up a Kafka broker☆104Updated 2 months ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆87Updated 10 months ago
- DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.☆116Updated 7 months ago
- ☆76Updated 8 years ago
- Utilities for processing Flink checkpoints/savepoints☆74Updated 5 years ago
- Big Data Toolkit for the JVM☆145Updated 4 years ago
- Tools to work with off-heap memory using sun.misc.Unsafe☆136Updated 7 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- Collection of advanced monitoring structures with rolling time window semantic for Dropwizard-Metrics library, including integration with…☆102Updated 2 years ago
- Mirror of Apache Gearpump (Incubating)☆297Updated 6 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆12Updated last year
- XPath likeness for Avro☆35Updated last year
- JSON decoder for AVRO that infers default values☆28Updated 4 years ago