itspawanbhardwaj / spark-fuzzy-matchingView external linksLinks
Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)
☆24Dec 30, 2019Updated 6 years ago
Alternatives and similar repositories for spark-fuzzy-matching
Users that are interested in spark-fuzzy-matching are comparing it to the libraries listed below
Sorting:
- Spark functions to run popular phonetic and string matching algorithms☆59Feb 22, 2022Updated 3 years ago
- Data engineering pipeline for the household COVID-19 Infection Survey (CIS)☆10Jul 18, 2023Updated 2 years ago
- A language detection Web Service☆53May 9, 2017Updated 8 years ago
- Record matching and entity resolution at scale in Spark☆36Oct 31, 2023Updated 2 years ago
- PDF to JSON, JSON to PDF and etc.☆12Apr 18, 2018Updated 7 years ago
- Workflow Automation with Microsoft Power Automate, 2nd Edition, Published by Packt☆17Jan 18, 2023Updated 3 years ago
- An awesome list that curates the best Flet tools, tutorials, blogs and more.☆10Jan 8, 2023Updated 3 years ago
- This web scraper is intended to extract data from The Home Depot Website, it could be run locally or in the Apify platform, the latter is…☆10Oct 13, 2022Updated 3 years ago
- phData Pulse application log aggregation and monitoring☆13Apr 13, 2020Updated 5 years ago
- Package provides java implementation of the latent dirichlet allocation (LDA) for topic modelling☆10May 18, 2017Updated 8 years ago
- Code Repository for Technical Program Manager's Handbook 2E, Published by Packt Publishing☆16Sep 25, 2024Updated last year
- ☆14Nov 27, 2025Updated 2 months ago
- Friday Forecasting Talks materials☆11May 24, 2024Updated last year
- This repo is a curated list of places I consider for weekends in Athens with my kid.☆11Dec 19, 2021Updated 4 years ago
- ☆11May 5, 2023Updated 2 years ago
- sbt plugin for scala modules.☆14Feb 9, 2026Updated last week
- A scikit-learn-compatible module for Isolation-based anomaly detection using nearest-neighbor ensembles☆11Aug 30, 2023Updated 2 years ago
- Bringing up Docker Compose environments for system, integration and performance testing, with support for ScalaTest and Gatling☆11Jul 29, 2021Updated 4 years ago
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆18Nov 28, 2025Updated 2 months ago
- ☆12Mar 12, 2024Updated last year
- Meet Rustacean GPT, an experimental project transforming OpenAi's GPT into a helpful, autonomous software engineer to support senior deve…☆14May 10, 2023Updated 2 years ago
- A nicer UI for AWS Glue Data Catalog☆10Jun 27, 2022Updated 3 years ago
- Subset Met Office MOGREPS-UK and UKV on AWS EC2☆12Oct 22, 2021Updated 4 years ago
- Light-weight Job Scheduling on Redis, in Python☆21Nov 14, 2016Updated 9 years ago
- ☆10Feb 2, 2023Updated 3 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- ☆14Jul 8, 2025Updated 7 months ago
- An R package to simulate fMRI Data Including Activated Data, Noise Data and Resting State Data☆11Sep 12, 2019Updated 6 years ago
- A batch (multiple concurrent sequence pairs) implementation of Dynamic Time Warping (DTW) in Theano☆10Sep 13, 2015Updated 10 years ago
- Plutus for the masses☆11Jan 20, 2023Updated 3 years ago
- A random name generator, written in Clojure☆10Jul 17, 2024Updated last year
- Interior Point Conic Optimization Solver☆10Updated this week
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated last month
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago
- Ted is a line oriented text editor and formatter☆12Jun 29, 2020Updated 5 years ago
- Shapeless generic instances for Scrooge types☆14Feb 16, 2018Updated 8 years ago
- ☆13May 28, 2025Updated 8 months ago
- R package for weighted model metrics☆11Apr 12, 2025Updated 10 months ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆17Feb 7, 2019Updated 7 years ago