ScalaCheck for Spark
☆63Apr 2, 2018Updated 7 years ago
Alternatives and similar repositories for sscheck
Users that are interested in sscheck are comparing it to the libraries listed below
Sorting:
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆108Feb 1, 2018Updated 8 years ago
- Base classes to use when writing tests with Spark☆1,549Dec 22, 2025Updated 2 months ago
- Spark implementation of Slowly Changing Dimension type 2☆11Jan 8, 2019Updated 7 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- Writing application logic for Spark jobs that can be unit-tested without a SparkContext☆76Jan 27, 2019Updated 7 years ago
- A set of scripts useful for managing Unix/Linux Oracle database environments.☆11Dec 13, 2022Updated 3 years ago
- An SBT plugin for automatically calling Avro code generation and a thin scala wrapper for reading and writing Avro files☆22Mar 8, 2018Updated 7 years ago
- ☆10Jun 9, 2019Updated 6 years ago
- OSGi and Jigsaw Working Together☆10Mar 6, 2017Updated 8 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- Source code for 'BigQuery for Data Warehousing' by Mark Mucchetti☆16Sep 28, 2020Updated 5 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Oct 1, 2022Updated 3 years ago
- A generic data pipeline which will map Elasticsearch documents to Bigquery table rows☆14Sep 29, 2019Updated 6 years ago
- basic postgres load generator☆11Sep 28, 2018Updated 7 years ago
- Getting started with Spark, Spark streaming, Spark SQL and DataFrame.☆48May 15, 2018Updated 7 years ago
- ZIO integration with AWS S3 SDK☆17Sep 24, 2020Updated 5 years ago
- Scripts complement the Optimizing a Data Vault data warehouse on the Snowflake Cloud Data Platform webinar☆16Oct 8, 2020Updated 5 years ago
- Snowflake scripts and useful snippets☆15Feb 2, 2025Updated last year
- Auto-fixing error due to version upgrade, good practice etc.☆11Sep 5, 2020Updated 5 years ago
- ☆13Nov 20, 2016Updated 9 years ago
- Run TPC-DS against different databases including Hive, Spark SQL and IBM BigSQL☆14Jan 4, 2022Updated 4 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16May 11, 2019Updated 6 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Expressive types for Spark.☆895Updated this week
- Cloudera Manager datasource for Grafana 3.x☆19Jun 28, 2023Updated 2 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Jan 6, 2017Updated 9 years ago
- Benchmarking read performance of PostgreSQL and MongoDB on same data sets.☆16Aug 14, 2018Updated 7 years ago
- ☆13Jul 29, 2024Updated last year
- A compact framework for automating a Snowflake analytics pipeline on Amazon ECS.☆18Apr 4, 2023Updated 2 years ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- Scala Logging Library☆37Sep 21, 2017Updated 8 years ago
- ☆17Jul 27, 2015Updated 10 years ago
- PeopleSoft: Configuration and Metrics Utility☆17Oct 6, 2022Updated 3 years ago
- RESTful APIs using Node.js, Express and Oracle (NEO)☆22May 28, 2017Updated 8 years ago
- BigQuery Schema Conversion Tool☆23Oct 6, 2020Updated 5 years ago
- Scala DSL for Unit-Testing Processing Topologies in Kafka Streams☆186Jan 16, 2021Updated 5 years ago
- Use it when you need to send and receive JSON via ActiveMQ in your Dropwizard application☆32Feb 4, 2024Updated 2 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆55May 20, 2015Updated 10 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago