Spark functions to run popular phonetic and string matching algorithms
☆59Feb 22, 2022Updated 4 years ago
Alternatives and similar repositories for spark-stringmetric
Users that are interested in spark-stringmetric are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PySpark phonetic and string matching algorithms☆41Feb 19, 2024Updated 2 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)☆24Dec 30, 2019Updated 6 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 5 years ago
- Write property based tests easily on spark dataframes☆20Jan 19, 2024Updated 2 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- ☆12Nov 2, 2024Updated last year
- Quartz Extension and utilities for cron-style scheduling in Apache Pekko☆12Dec 25, 2025Updated 3 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 7 months ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Expressive types for Spark.☆896Apr 7, 2026Updated last week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆455Apr 2, 2026Updated 2 weeks ago
- Scala wrapper for SnakeYAML☆102Sep 13, 2022Updated 3 years ago
- Used to generate mock Avro data☆15Jun 23, 2018Updated 7 years ago
- This project is about numbers: exact (1, e, π, 𝛙, √2, etc.), fuzzy e.g., 1836.152673426(32), or lazy e.g., cos(2π), as quantities (with …☆16Mar 25, 2026Updated 3 weeks ago
- Repository of my talk for Bayes@Lund 2017☆10Oct 4, 2017Updated 8 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Twitter auto account report bot using selenium with python☆13Apr 19, 2024Updated last year
- Poison pills and Kafka Streams demo☆10Jul 25, 2020Updated 5 years ago
- ⚡ Live demo environment for Django Templates fully rendered in the browser, with PyScript☆12Sep 21, 2022Updated 3 years ago
- A giter8 template for Spark SBT projects☆72Mar 20, 2021Updated 5 years ago
- Predicting survival on the Titanic☆16Dec 16, 2017Updated 8 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- A docker image with a pre-configured Hive Metastore and a Spark ThriftServer☆19Jan 20, 2020Updated 6 years ago
- A collection of Lambda related implementations, libraries, resources an useful stuff.☆15Aug 26, 2022Updated 3 years ago
- A low-dependency HTTP health check server for Scala☆13Apr 7, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Social value orientation (SVO) notes for pro-social pro-self concepts☆12Apr 14, 2025Updated last year
- String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gr…☆492Jul 28, 2017Updated 8 years ago
- Query OSM planet stats with AWS Athena☆23May 13, 2019Updated 6 years ago
- A tool to validate data, built around Apache Spark.☆101Feb 19, 2026Updated last month
- Support for JDK9's Multi Release JAR Files (JEP 238)☆17Sep 5, 2024Updated last year
- ☆24Mar 30, 2026Updated 2 weeks ago
- Fast JSON parser/generator for Scala☆115Mar 22, 2026Updated 3 weeks ago