Spark functions to run popular phonetic and string matching algorithms
☆60Feb 22, 2022Updated 4 years ago
Alternatives and similar repositories for spark-stringmetric
Users that are interested in spark-stringmetric are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PySpark phonetic and string matching algorithms☆41Feb 19, 2024Updated 2 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)☆24Dec 30, 2019Updated 6 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 5 years ago
- Write property based tests easily on spark dataframes☆21Jan 19, 2024Updated 2 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- ☆12Nov 2, 2024Updated last year
- Quartz Extension and utilities for cron-style scheduling in Apache Pekko☆12Dec 25, 2025Updated 5 months ago
- A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dic…☆83Apr 25, 2022Updated 4 years ago
- Essential Spark extensions and helper methods ✨😲☆767Sep 14, 2025Updated 9 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆63Sep 4, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15May 19, 2019Updated 7 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 4 months ago
- Spark style guide☆270Sep 30, 2024Updated last year
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆191Oct 15, 2025Updated 8 months ago
- sbt plugin to allow dependency resolution and artifact publishing for gitlab☆10Mar 1, 2026Updated 3 months ago
- Expressive types for Spark.☆898Updated this week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆456Apr 2, 2026Updated 2 months ago
- Scala wrapper for SnakeYAML☆103Sep 13, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- type-class based data cleansing library for Apache Spark SQL☆79Jun 23, 2019Updated 6 years ago
- ShikiPlayer — это пользовательский скрипт, который встраивает видеоплееры на сайт Shikimori☆32May 27, 2026Updated 3 weeks ago
- A module for the decline command line parser to enable bash and zsh autocomplete☆14Aug 7, 2023Updated 2 years ago
- This project is about numbers: exact (1, e, π, 𝛙, √2, etc.), fuzzy e.g., 1836.152673426(32), or lazy e.g., cos(2π), as quantities (with …☆16Jun 3, 2026Updated last week
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- ⚡ Live demo environment for Django Templates fully rendered in the browser, with PyScript☆12Sep 21, 2022Updated 3 years ago
- A giter8 template for Spark SBT projects☆72Mar 20, 2021Updated 5 years ago
- Predicting survival on the Titanic☆16Dec 16, 2017Updated 8 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Creating Debian Packages from CRAN Sources☆12Jul 1, 2020Updated 5 years ago
- A collection of Lambda related implementations, libraries, resources an useful stuff.☆15Aug 26, 2022Updated 3 years ago
- A low-dependency HTTP health check server for Scala☆14Updated this week
- ☆12May 16, 2017Updated 9 years ago
- Social value orientation (SVO) notes for pro-social pro-self concepts☆13Apr 14, 2025Updated last year
- String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gr…☆492Jul 28, 2017Updated 8 years ago
- Query OSM planet stats with AWS Athena☆23May 13, 2019Updated 7 years ago