Spark functions to run popular phonetic and string matching algorithms
☆59Feb 22, 2022Updated 4 years ago
Alternatives and similar repositories for spark-stringmetric
Users that are interested in spark-stringmetric are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)☆24Dec 30, 2019Updated 6 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 5 years ago
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 5 years ago
- Write property based tests easily on spark dataframes☆21Jan 19, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- ☆12Nov 2, 2024Updated last year
- Quartz Extension and utilities for cron-style scheduling in Apache Pekko☆12Dec 25, 2025Updated 4 months ago
- A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dic…☆83Apr 25, 2022Updated 4 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 7 months ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 3 months ago
- Spark style guide☆271Sep 30, 2024Updated last year
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆191Oct 15, 2025Updated 6 months ago
- sbt plugin to allow dependency resolution and artifact publishing for gitlab☆10Mar 1, 2026Updated 2 months ago
- Expressive types for Spark.☆896Apr 27, 2026Updated last week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆455Apr 2, 2026Updated last month
- Scala wrapper for SnakeYAML☆103Sep 13, 2022Updated 3 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- A module for the decline command line parser to enable bash and zsh autocomplete☆14Aug 7, 2023Updated 2 years ago
- ShikiPlayer — это пользовательский скрипт, который встраивает видеоплееры на сайт Shikimori☆28Mar 5, 2026Updated 2 months ago
- Used to generate mock Avro data☆15Jun 23, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This project is about numbers: exact (1, e, π, 𝛙, √2, etc.), fuzzy e.g., 1836.152673426(32), or lazy e.g., cos(2π), as quantities (with …☆16Apr 26, 2026Updated last week
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- Scala data validation library☆30Aug 14, 2016Updated 9 years ago
- Twitter auto account report bot using selenium with python☆13Apr 19, 2024Updated 2 years ago
- Poison pills and Kafka Streams demo☆10Jul 25, 2020Updated 5 years ago
- ⚡ Live demo environment for Django Templates fully rendered in the browser, with PyScript☆12Sep 21, 2022Updated 3 years ago
- An R package for visualising high-dimensional clustering algorithms☆16Apr 24, 2014Updated 12 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- 🏡 The home page for an opinionated intermediate/advanced Git book☆12Aug 22, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Talks, Meetup and Workshops☆12Jun 4, 2024Updated last year
- Creating Debian Packages from CRAN Sources☆12Jul 1, 2020Updated 5 years ago
- A collection of Lambda related implementations, libraries, resources an useful stuff.☆15Aug 26, 2022Updated 3 years ago
- A low-dependency HTTP health check server for Scala☆13Apr 29, 2026Updated last week
- Social value orientation (SVO) notes for pro-social pro-self concepts☆13Apr 14, 2025Updated last year
- String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gr…☆492Jul 28, 2017Updated 8 years ago
- Query OSM planet stats with AWS Athena☆23May 13, 2019Updated 6 years ago