"1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook
☆84Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for deduplication-slides
Users that are interested in deduplication-slides are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,049Feb 21, 2024Updated 2 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,451Jul 29, 2025Updated 8 months ago
- ☆12Apr 27, 2018Updated 7 years ago
- An online jukebox with all the songs from Deezer and YouTube. Built with Django and Angular.☆22Apr 11, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A list of free data matching and record linkage software.☆403Feb 21, 2024Updated 2 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆127Feb 22, 2024Updated 2 years ago
- Example frontera project☆12Aug 10, 2017Updated 8 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Aug 9, 2022Updated 3 years ago
- Site do PugPE☆16Jul 19, 2023Updated 2 years ago
- Site da Python Brasil 2020☆11Nov 6, 2020Updated 5 years ago
- Easy Django integration with Elasticsearch through ZomboDB Postgres Extension☆148Dec 28, 2022Updated 3 years ago
- Analise dos casos de violência contra a mulher no estado de PE☆14Aug 23, 2022Updated 3 years ago
- Temporary repository for implementing tensor factorization algorithms on Apache Spark☆13Nov 27, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python implementations of record linkage blocking techniques.☆21Oct 2, 2023Updated 2 years ago
- Connect to Microsoft 365 using the MS Graph API - macro functions for listing content, upload, download, and more.☆16Jan 31, 2026Updated 2 months ago
- generic extraction recipes to get you started extracting schema.org entities for your software, data, and all things☆14Apr 6, 2019Updated 7 years ago
- a python library for parsing unstructured western names into name components.☆618May 15, 2025Updated 10 months ago
- This project contains simple methods to measure sample relatedness and identify potential swaps and contamination☆10Jan 8, 2016Updated 10 years ago
- Scalable String Similarity Joins in Python☆39Jul 12, 2024Updated last year
- ☆21Jul 6, 2023Updated 2 years ago
- ☆14Sep 22, 2022Updated 3 years ago
- Extremely accurate algorithm used to group DNA sequences from microbial communities into operational taxonomic units (proxy for species) …☆14Nov 20, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Levenshtein distance between two strings in julia☆14May 15, 2019Updated 6 years ago
- Query OSM planet stats with AWS Athena☆23May 13, 2019Updated 6 years ago
- Copula fitting in Python.☆13Dec 4, 2023Updated 2 years ago
- Link Wikidata items to large catalogs☆96Mar 2, 2026Updated last month
- Super Fast String Matching in Python☆370Mar 14, 2025Updated last year
- Digital Database of Microbial Phenotypes. Like an online Bergey's Manual.☆13Mar 1, 2012Updated 14 years ago
- AlgoTree☆16Jan 30, 2026Updated 2 months ago
- ☆36Aug 13, 2017Updated 8 years ago
- Python for multiobjective cash management☆12Sep 21, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Materials for the "Data Wrangling" CADi workshop @ "Tecnológico de Monterrey"☆10Dec 21, 2021Updated 4 years ago
- PowerShell auto-completion providers for some cmdlets and native commands☆12Jan 30, 2025Updated last year
- Django App to integrate API Star's routes and views into Django's ecossystem.☆23Sep 18, 2018Updated 7 years ago
- Portal do Grupo de Usuários Python de Pernambuco☆16Mar 10, 2012Updated 14 years ago
- A collection of Python scripts☆12Feb 7, 2020Updated 6 years ago
- A Python port of the Perl address parser available at http://search.cpan.org/~timb/Geo-StreetAddress-US-1.03/US.pm☆26Jan 31, 2019Updated 7 years ago
- MAL(make a lisp) interpreter written in Go.☆10Feb 18, 2023Updated 3 years ago