Fast, flexible name matching for large datasets
β71Aug 29, 2025Updated 6 months ago
Alternatives and similar repositories for nama
Users that are interested in nama are comparing it to the libraries listed below
Sorting:
- π Fuzzy Name Matching with Machine Learningβ267Jun 17, 2024Updated last year
- β12May 20, 2023Updated 2 years ago
- Training for assessing replicabilityβ14Jan 8, 2026Updated last month
- A Python library for defining rule-based overrides on messy dataβ18Nov 24, 2025Updated 3 months ago
- π Finds fuzzy matches between datasetsβ16Jan 26, 2026Updated last month
- β10Dec 13, 2014Updated 11 years ago
- this is the code that goes along with the AJC story at https://www.ajc.com/news/state--regional-govt--politics/precinct-closures-harm-votβ¦β13Dec 13, 2019Updated 6 years ago
- A database on VC-backed startups from Ewens and Malenko (2025)β13Feb 15, 2025Updated last year
- Automatic Text Summarization with Machine Learningβ15Jul 30, 2017Updated 8 years ago
- Name matching algorithm for company and people name in Englishβ15Dec 3, 2023Updated 2 years ago
- β21Dec 11, 2024Updated last year
- Person name matching toolsβ13Aug 31, 2017Updated 8 years ago
- Python language parser for a tabular format for structured metadata. http://metatab.orgβ18Sep 28, 2023Updated 2 years ago
- DJIA index prices of 10 years and NYtimes news articles headline has been used to predict the DJIA index pricesβ18Feb 21, 2018Updated 8 years ago
- Two way models in pythonβ36Feb 6, 2024Updated 2 years ago
- yet another foia automation serviceβ44Jul 6, 2022Updated 3 years ago
- Codes required to implement various approaches to historical record linkingβ20Jul 8, 2020Updated 5 years ago
- A work-in-progress guide showing how and why you should learn command-line tools (xsv, csvkit) to work with dataβ19Mar 16, 2019Updated 6 years ago
- Implements the model described in "Identification, Interpretability, and Bayesian Word Embeddings"β19Jun 5, 2019Updated 6 years ago
- An automatic paraphraser/summarizer/information extractor built using Python.β17Apr 1, 2016Updated 9 years ago
- Python package for performing Entity and Text Matching using Deep Learning.β614Jun 18, 2024Updated last year
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasetsβ118Nov 21, 2025Updated 3 months ago
- Code release for "A Time-Aware Transformer Based Model for Suicide Ideation Detection on Social Media", EMNLP 2020.β54Nov 16, 2020Updated 5 years ago
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.β52Jul 31, 2017Updated 8 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ1,046Feb 21, 2024Updated 2 years ago
- Stata command to perform randomization inference and permutation tests, allowing for arbitrary randomization procedures with (almost) anyβ¦β34Jun 4, 2025Updated 8 months ago
- Fast sparse regressions with advanced formula syntax. OLS, GLM, Poisson, Maxlike, and more. High-dimensional fixed effects.β67Jun 27, 2023Updated 2 years ago
- The EU structural funds datasets on regional and national level (in progress).β29May 31, 2023Updated 2 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4β286Aug 9, 2022Updated 3 years ago
- Learned string similarity for entity names using optimal transport.β35Nov 17, 2020Updated 5 years ago
- data wrangling simplicity, complete audit transparency, and at speedβ35Sep 30, 2025Updated 5 months ago
- For watching a set of URLs and notifying someone when something has changed.β32Jun 12, 2017Updated 8 years ago
- Myanmar consonant and vowel audio files that I recorded at University of Computer Studies Banmawβ11Mar 2, 2019Updated 7 years ago
- β10Feb 19, 2024Updated 2 years ago
- β12Apr 26, 2020Updated 5 years ago
- Google Ticker Stock SVI (TS-SVI)β16Dec 16, 2024Updated last year
- Easy formatted text extraction from images using Google Vision APIβ41Jun 9, 2021Updated 4 years ago
- Medical Relations and Entities Extractionβ37Jun 21, 2022Updated 3 years ago
- Extended stochastic block models with application to criminal networksβ12Jan 16, 2022Updated 4 years ago