Scalable String Similarity Joins in Python
☆39Jul 12, 2024Updated last year
Alternatives and similar repositories for py_stringsimjoin
Users that are interested in py_stringsimjoin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆144Feb 18, 2026Updated 2 months ago
- ☆192May 29, 2024Updated last year
- Hidden alignment conditional random field for classifying string pairs.☆36Sep 6, 2017Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Jan 12, 2026Updated 3 months ago
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Mar 23, 2015Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- C++ Vantage Point Tree implementation with Python bindings☆16Jun 17, 2023Updated 2 years ago
- Asynchronous financial data management☆22Oct 3, 2017Updated 8 years ago
- albert-vi-as-service: A Fork of bert-as-service to deploy albert_vi☆11Apr 29, 2020Updated 6 years ago
- A DP beam-search extension of Mitchell Stern's span-based neural constituency parser☆11Aug 24, 2022Updated 3 years ago
- Approximate and vectorized versions of common mathematical functions☆13Mar 1, 2017Updated 9 years ago
- generic extraction recipes to get you started extracting schema.org entities for your software, data, and all things☆14Apr 6, 2019Updated 7 years ago
- ☆15Apr 6, 2018Updated 8 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆141Jun 21, 2012Updated 13 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- A script to generate tagged XML Citationstrings for citation parsing☆20Apr 17, 2020Updated 6 years ago
- Suite of tools for game developers building on MUD☆12Mar 13, 2024Updated 2 years ago
- Grapheme to phoneme toolkit using joint-modelling + CRFs in java☆14Jul 14, 2018Updated 7 years ago
- Levenshtein distance between two strings in julia☆14May 15, 2019Updated 6 years ago
- An interactive tool for analyzing, executing, and improving dynamic programming algorithms.☆22Apr 11, 2026Updated 3 weeks ago
- A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).☆15Jun 4, 2021Updated 4 years ago
- Python tools to build, do inference with, and learn undirected graphical models.☆14Mar 25, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A fast implementation of GloVe, with optional retrofitting☆12Apr 16, 2019Updated 7 years ago
- Optimally-weighted herding is Bayesian Quadrature☆16Jul 8, 2016Updated 9 years ago
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- Daily refreshed data on representation certification and unfair labor cases from nlrb.gov☆21Nov 13, 2025Updated 5 months ago
- ☆17Mar 12, 2021Updated 5 years ago
- linear-time dynamic programming dependency parser☆11Feb 2, 2019Updated 7 years ago
- Geopandas and Shapely☆10Jul 29, 2018Updated 7 years ago
- ☆13Nov 3, 2016Updated 9 years ago
- A list of free data matching and record linkage software.☆403Feb 21, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Name matching algorithm for company and people name in English☆15Dec 3, 2023Updated 2 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,048Feb 21, 2024Updated 2 years ago
- A scheduler to manage a multi tool dual arm robot while avoiding arm-to-arm collisions; considering complex side constraints; and optimiz…☆11Jul 6, 2021Updated 4 years ago
- Enrich sf data with geographic features from OpenStreetMaps.☆19Dec 21, 2021Updated 4 years ago
- A collection of Python scripts☆12Feb 7, 2020Updated 6 years ago
- Supplementary code for "Name2Vec: Personal Names Embeddings" presented at The Canadian Conference on AI 2019.☆18Jun 25, 2020Updated 5 years ago
- Command line utility for d3-pre pre-rendering pipeline☆13Jul 14, 2016Updated 9 years ago