Spark functions to run popular phonetic and string matching algorithms
☆59Feb 22, 2022Updated 4 years ago
Alternatives and similar repositories for spark-stringmetric
Users that are interested in spark-stringmetric are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PySpark phonetic and string matching algorithms☆41Feb 19, 2024Updated 2 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)☆24Dec 30, 2019Updated 6 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Write property based tests easily on spark dataframes☆20Jan 19, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- ☆12Nov 2, 2024Updated last year
- A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dic…☆82Apr 25, 2022Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Sep 4, 2023Updated 2 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 6 months ago
- ☆15May 19, 2019Updated 6 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- Spark style guide☆272Sep 30, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- sbt plugin to allow dependency resolution and artifact publishing for gitlab☆10Mar 1, 2026Updated 3 weeks ago
- Expressive types for Spark.☆896Mar 16, 2026Updated last week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆455Feb 8, 2026Updated last month
- Scala wrapper for SnakeYAML☆101Sep 13, 2022Updated 3 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- A module for the decline command line parser to enable bash and zsh autocomplete☆14Aug 7, 2023Updated 2 years ago
- ShikiPlayer — это пользовательский скрипт, который встраивает видеоплееры на сайт Shikimori☆25Mar 5, 2026Updated 3 weeks ago
- Used to generate mock Avro data☆15Jun 23, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This project is about numbers: exact (1, e, π, 𝛙, √2, etc.), fuzzy e.g., 1836.152673426(32), or lazy e.g., cos(2π), as quantities (with …☆16Mar 18, 2026Updated last week
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- Scala data validation library☆29Aug 14, 2016Updated 9 years ago
- Poison pills and Kafka Streams demo☆10Jul 25, 2020Updated 5 years ago
- Predicting survival on the Titanic☆16Dec 16, 2017Updated 8 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- Talks, Meetup and Workshops☆12Jun 4, 2024Updated last year
- ☆12May 16, 2017Updated 8 years ago
- Create STAC Collections/Items for some AWS OpenData☆16Jan 18, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gr…☆491Jul 28, 2017Updated 8 years ago
- A tool to validate data, built around Apache Spark.☆101Feb 19, 2026Updated last month
- Support for JDK9's Multi Release JAR Files (JEP 238)☆17Sep 5, 2024Updated last year
- A small yet nice package to help you parse all types of URL and return the parsed url with group name.☆14Jun 5, 2020Updated 5 years ago
- ☆23Updated this week
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- Fast JSON parser/generator for Scala☆114Updated this week