hammerlab / spark-utilLinks
low-level helpers for Apache Spark libraries and tests
☆16Updated 6 years ago
Alternatives and similar repositories for spark-util
Users that are interested in spark-util are comparing it to the libraries listed below
Sorting:
- Deriving Spark DataFrame schemas from case classes☆44Updated last year
- Utilities for writing tests that use Apache Spark.☆24Updated 6 years ago
- Writing application logic for Spark jobs that can be unit-tested without a SparkContext☆77Updated 6 years ago
- ☆45Updated 5 years ago
- A collection of Apache Parquet add-on modules☆30Updated 3 weeks ago
- something to help you spark☆65Updated 6 years ago
- Impatient fork of Ammonite☆62Updated 7 years ago
- Argument parsing in Scala☆83Updated 2 years ago
- A quotation-based Scala DSL for scalable data analysis.☆63Updated 3 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 9 years ago
- Framian☆115Updated 7 years ago
- Simple SBT plugin to configure Spark applications☆24Updated last year
- An SBT plugin for automatically calling Avro code generation and a thin scala wrapper for reading and writing Avro files☆22Updated 7 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆28Updated 8 years ago
- Scala + Druid: Scruid. A library that allows you to compose queries in Scala, and parse the result back into typesafe classes.☆115Updated 4 years ago
- Translates xml -> awesome. Maven-ish support for sbt.☆76Updated 4 months ago
- Big Data Toolkit for the JVM☆146Updated 4 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Updated 6 years ago
- Lightweight, functional and correct time-series library for scala. Easy manipulation, filtering and combination of time-series data.☆30Updated 3 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆112Updated 5 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29Updated 5 years ago
- Dynamically defines and loads Scala classes at runtime. Useful for turning JSON schemas into Scala case classes on the fly.☆44Updated 9 years ago
- Single node, in-memory DataFrame analytics library.☆41Updated 9 months ago
- Shapeless utilities for common data types☆67Updated last week
- Activator template for Reactive Kafka☆20Updated 8 years ago
- Scripts for parsing / making sense of yarn logs☆52Updated 8 years ago
- ScalaCheck for Spark☆63Updated 7 years ago
- ☆29Updated 10 years ago
- Discover java object sizes through questionable sleuthing plus luck.☆67Updated 7 years ago
- front end to view akka cluster topography☆48Updated 8 years ago