nevillelyh / parquet-extraLinks
A collection of Apache Parquet add-on modules
☆30Updated last month
Alternatives and similar repositories for parquet-extra
Users that are interested in parquet-extra are comparing it to the libraries listed below
Sorting:
- Compile-time tools for working with Avros in Scala☆55Updated 7 years ago
- Fast, memory-efficient, minimal-serialization, binary data vectors for Scala and other languages☆67Updated 7 years ago
- An SBT plugin for automatically calling Avro code generation and a thin scala wrapper for reading and writing Avro files☆22Updated 7 years ago
- Generic protobuf manipulation☆36Updated last month
- An embedded job scheduler.☆116Updated last year
- A quotation-based Scala DSL for scalable data analysis.☆63Updated 3 years ago
- Writing application logic for Spark jobs that can be unit-tested without a SparkContext☆77Updated 6 years ago
- ScalaCheck for Spark☆63Updated 7 years ago
- Thyme is a microbenchmark utility for Scala. It includes Parsley, a (simple) local profiling tool.☆168Updated 8 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Updated 8 years ago
- Scio IDEA plugin☆30Updated last week
- Large-scale event processing with Akka Persistence and Apache Spark☆273Updated 9 years ago
- Fluent Scala DSL for Google's Cloud Dataflow SDK☆56Updated 10 years ago
- Big Data Toolkit for the JVM☆145Updated 4 years ago
- Framian☆115Updated 7 years ago
- Deriving Spark DataFrame schemas from case classes☆44Updated last year
- Discover java object sizes through questionable sleuthing plus luck.☆68Updated 7 years ago
- something to help you spark☆64Updated 6 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 9 years ago
- Scala + Druid: Scruid. A library that allows you to compose queries in Scala, and parse the result back into typesafe classes.☆115Updated 4 years ago
- Translates xml -> awesome. Maven-ish support for sbt.☆76Updated this week
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- low-level helpers for Apache Spark libraries and tests☆16Updated 6 years ago
- Shapeless utilities for common data types☆67Updated last month
- Bucketing and partitioning system for Parquet☆30Updated 7 years ago
- ☆45Updated 5 years ago
- SBT Plugins for AI2 projects☆24Updated 2 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated 2 years ago
- Builds models from JSON Schemas☆103Updated 5 years ago
- Dynamically defines and loads Scala classes at runtime. Useful for turning JSON schemas into Scala case classes on the fly.☆44Updated 10 years ago