Spark functions to run popular phonetic and string matching algorithms
☆59Feb 22, 2022Updated 4 years ago
Alternatives and similar repositories for spark-stringmetric
Users that are interested in spark-stringmetric are comparing it to the libraries listed below
Sorting:
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- PySpark phonetic and string matching algorithms☆41Feb 19, 2024Updated 2 years ago
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 4 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 4 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- ☆12Nov 2, 2024Updated last year
- Quartz Extension and utilities for cron-style scheduling in Apache Pekko☆12Dec 25, 2025Updated 2 months ago
- Poison pills and Kafka Streams demo☆10Jul 25, 2020Updated 5 years ago
- Write property based tests easily on spark dataframes☆20Jan 19, 2024Updated 2 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- A low-dependency HTTP health check server for Scala☆13Feb 26, 2026Updated last week
- A tool to validate data, built around Apache Spark.☆101Feb 19, 2026Updated 2 weeks ago
- Used to generate mock Avro data☆15Jun 23, 2018Updated 7 years ago
- This project is about numbers: exact (1, e, π, 𝛙, √2, etc.), fuzzy e.g., 1836.152673426(32), or lazy e.g., cos(2π), as quantities (with …☆15Feb 23, 2026Updated last week
- Spark style guide☆271Sep 30, 2024Updated last year
- Cross Platform Scala 2d graphics (but 3d compatible), basic geometry, maps, Earth maps, hex-tiling and strategy library(s).☆23Feb 27, 2026Updated last week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆454Feb 8, 2026Updated 3 weeks ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 5 months ago
- Expressive types for Spark.☆895Updated this week
- Support for JDK9's Multi Release JAR Files (JEP 238)☆17Sep 5, 2024Updated last year
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- A docker image with a pre-configured Hive Metastore and a Spark ThriftServer☆19Jan 20, 2020Updated 6 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dic…☆82Apr 25, 2022Updated 3 years ago
- A core AST and utilities to manipulate geographical data☆22Sep 30, 2022Updated 3 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆187Oct 15, 2025Updated 4 months ago
- This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server☆50Jul 16, 2023Updated 2 years ago
- ☆21Feb 24, 2026Updated last week
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Sep 4, 2023Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated last month
- ScalaTest + ScalaCheck provides integration support between ScalaTest and ScalaCheck.☆58Dec 15, 2025Updated 2 months ago
- Scala wrapper for SnakeYAML☆101Sep 13, 2022Updated 3 years ago
- R code to work with the NIH RePORTER API☆11Feb 11, 2026Updated 3 weeks ago
- An extension to the amazing Spark framework for better functional programming.☆28May 19, 2016Updated 9 years ago
- Generic Framework over Scala 3 enumerations☆30Jan 30, 2024Updated 2 years ago
- Serialization toolbox for Akka messages, events and persistent state that helps achieve compile-time guarantee on serializability. No mor…☆30Updated this week
- A set of tools that make working with the Scala ecosystem even better.☆12Updated this week