vaneseltine / nominally
A maximum-strength name parser for record linkage.
☆36Updated 5 months ago
Alternatives and similar repositories for nominally:
Users that are interested in nominally are comparing it to the libraries listed below
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A browser user interface for manual labeling of record pairs.☆43Updated last year
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 5 years ago
- Just charts. Really.☆22Updated last year
- Inspect a URL and estimate if it contains a news story☆39Updated 2 months ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- ☆13Updated 5 years ago
- A Python library for defining rule-based overrides on messy data☆13Updated 2 months ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆22Updated 3 months ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 4 years ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 5 months ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated this week
- A financial disclosure data extraction tool.☆13Updated last year
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- (Archived) A Python library for record linkage and deduplication.☆19Updated 10 months ago
- Scalable String Similarity Joins in Python☆38Updated 6 months ago
- Slideshow template for Voilà based on RevealJS☆16Updated 3 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- Python implementation of the Data Package standard and various models and utils for working with data.☆15Updated 4 months ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- Fuzzy Categorical Distances☆14Updated 4 years ago