Scalable String Similarity Joins in Python
☆38Jul 12, 2024Updated last year
Alternatives and similar repositories for py_stringsimjoin
Users that are interested in py_stringsimjoin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆144Feb 18, 2026Updated 3 months ago
- ☆192May 29, 2024Updated last year
- Hidden alignment conditional random field for classifying string pairs.☆36Sep 6, 2017Updated 8 years ago
- Python package for performing Entity and Text Matching using Deep Learning.☆616Jun 18, 2024Updated last year
- Learned string similarity for entity names using optimal transport.☆35Nov 17, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Mar 23, 2015Updated 11 years ago
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- Implementation of Shake-Shake by chainer (Shake-Shake regularization of 3-branch residual networks: https://openreview.net/forum?id=HkO-P…☆10Aug 24, 2017Updated 8 years ago
- Asynchronous financial data management☆22Oct 3, 2017Updated 8 years ago
- MOVED to https://gitlab.com/crossref/reference_matching_evaluation_framework☆17Jul 1, 2019Updated 6 years ago
- Learning String Alignments for Entity Aliases☆37Mar 21, 2019Updated 7 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- FPsolve: solver for polynomial equations over omega-continuous semirings☆11Aug 15, 2015Updated 10 years ago
- A DP beam-search extension of Mitchell Stern's span-based neural constituency parser☆11Aug 24, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Approximate and vectorized versions of common mathematical functions☆13Mar 1, 2017Updated 9 years ago
- A Rete-based, CLIPS-clone, inference engine in Python.☆19Feb 4, 2013Updated 13 years ago
- ☆16Jan 7, 2021Updated 5 years ago
- ☆17May 7, 2026Updated 2 weeks ago
- This is the implementation of word aligner using Hidden Markov Model☆10Jun 24, 2019Updated 6 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 9 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆141Jun 21, 2012Updated 13 years ago
- Active Imitation Learing with Noisy Guidance☆10May 29, 2020Updated 5 years ago
- Suite of tools for game developers building on MUD☆12Mar 13, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Java tools to do natural language processing like NER and intent classification on short sentences☆16Aug 12, 2018Updated 7 years ago
- Levenshtein distance between two strings in julia☆14May 15, 2019Updated 7 years ago
- A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).☆15Jun 4, 2021Updated 4 years ago
- Windows SDK for the Microsoft Entity Linking Intelligence Service, part of Cognitive Services☆21Jan 18, 2017Updated 9 years ago
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆30Dec 6, 2024Updated last year
- An index data structure for approximate string search.☆23May 6, 2019Updated 7 years ago
- A fast implementation of GloVe, with optional retrofitting☆12Apr 16, 2019Updated 7 years ago
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- Daily refreshed data on representation certification and unfair labor cases from nlrb.gov☆22Nov 13, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the superse…☆23Sep 15, 2023Updated 2 years ago
- A pipeline for automated mapping of aggregate racial/ancestral groups - based on a 1976 map of Chicago☆21Oct 17, 2017Updated 8 years ago
- Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing☆22Sep 24, 2024Updated last year
- Visualizations of character embeddings from derived character vectors.☆13Apr 4, 2017Updated 9 years ago
- Geopandas and Shapely☆10Jul 29, 2018Updated 7 years ago
- ☆24May 5, 2026Updated 2 weeks ago
- Name matching algorithm for company and people name in English☆15Dec 3, 2023Updated 2 years ago