Scalable String Similarity Joins in Python
☆38Jul 12, 2024Updated last year
Alternatives and similar repositories for py_stringsimjoin
Users that are interested in py_stringsimjoin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆144Feb 18, 2026Updated 4 months ago
- ☆192May 29, 2024Updated 2 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Sep 6, 2017Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Jan 12, 2026Updated 5 months ago
- Python package for performing Entity and Text Matching using Deep Learning.☆620Jun 18, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 3 years ago
- Asynchronous financial data management☆22Oct 3, 2017Updated 8 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- FPsolve: solver for polynomial equations over omega-continuous semirings☆11Aug 15, 2015Updated 10 years ago
- Build-to-Order BLAS☆12Apr 9, 2019Updated 7 years ago
- ☆14Feb 1, 2024Updated 2 years ago
- A Rete-based, CLIPS-clone, inference engine in Python.☆19Feb 4, 2013Updated 13 years ago
- ☆18Jun 26, 2026Updated last week
- ☆15Apr 6, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 9 years ago
- The Ethereum Canvas☆10Oct 19, 2017Updated 8 years ago
- This repository contains the code and data download links to reproduce the experiments of the PVLDB paper "Dual-Objective Fine-Tuning of …☆16Jun 7, 2021Updated 5 years ago
- An interactive tool for analyzing, executing, and improving dynamic programming algorithms.☆23Jun 26, 2026Updated last week
- Levenshtein distance between two strings in julia☆15May 15, 2019Updated 7 years ago
- A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).☆15Jun 4, 2021Updated 5 years ago
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆30Dec 6, 2024Updated last year
- An index data structure for approximate string search.☆23May 6, 2019Updated 7 years ago
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Daily refreshed data on representation certification and unfair labor cases from nlrb.gov☆22Jun 15, 2026Updated 2 weeks ago
- A GitBook about creating a GitBook for teaching☆10Apr 21, 2020Updated 6 years ago
- A Deep Learning based project for colorizing and restoring old images (and video!)☆23Jul 21, 2020Updated 5 years ago
- Geopandas and Shapely☆10Jul 29, 2018Updated 7 years ago
- ☆13Nov 3, 2016Updated 9 years ago
- ☆24May 5, 2026Updated last month
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,055Feb 21, 2024Updated 2 years ago
- Enrich sf data with geographic features from OpenStreetMaps.☆19Dec 21, 2021Updated 4 years ago
- A collection of Python scripts☆12Feb 7, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Command line utility for d3-pre pre-rendering pipeline☆13Jul 14, 2016Updated 9 years ago
- A ChatGPT plugin for Solana☆13Jun 1, 2023Updated 3 years ago
- A CoroutineExecutor for asyncio, similar to nurseries and task groups☆13Aug 20, 2022Updated 3 years ago
- An rope jumping application on Android and Apple Watch☆12Sep 7, 2018Updated 7 years ago
- Entitypedia is an Extended Named Entity Dictionary from Wikipedia.☆13Dec 7, 2022Updated 3 years ago
- A bunch of tools for automating parts of a Systematic Review of scientific literature☆14Sep 16, 2020Updated 5 years ago
- Functional interface for concurrent futures, including async coroutines.☆11Jun 25, 2026Updated last week