Python package for deduplication/entity resolution using active learning
☆83Aug 24, 2024Updated last year
Alternatives and similar repositories for deduplipy
Users that are interested in deduplipy are comparing it to the libraries listed below
Sorting:
- Analyzing the tree of imports of running Python code.☆12Feb 17, 2023Updated 3 years ago
- Fast fuzzy text search☆12May 16, 2023Updated 2 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- Keep your configuration files in sync☆16Jan 6, 2026Updated 2 months ago
- Just some FastHTML demos for safekeeps☆13Dec 10, 2024Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- Integration with (approximate) nearest neighbors libraries for scikit-learn + clustering based on with kNN-graphs.☆23Mar 3, 2026Updated last week
- Efficiently search and mine for specific (targeted) classes/slices in your dataset to improve model performance and personalize your mode…☆20Nov 17, 2023Updated 2 years ago
- Applying Snorkel to SuperGLUE☆26Dec 16, 2019Updated 6 years ago
- motivational website to do something special this month☆21Jan 11, 2024Updated 2 years ago
- Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orchestration (under development)☆25Mar 2, 2026Updated last week
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆66Updated this week
- It's a cooler way to store simple linear models.☆27Jul 15, 2024Updated last year
- Makes it easy to use altair from FastHTML☆28Oct 9, 2024Updated last year
- A ninja python package that unifies the Google Earth Engine ecosystem.☆66Updated this week
- Doubt your data, find bad labels.☆517Jul 15, 2024Updated last year
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning ex…☆53Dec 10, 2023Updated 2 years ago
- Super Simple Similarities Service☆155Apr 11, 2025Updated 10 months ago
- A lightweight implementation of shapes drawn across a geo-temporal plane.☆12Jan 27, 2026Updated last month
- Safitty is a wrapper on JSON/YAML configs for Python☆30Mar 19, 2020Updated 5 years ago
- edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries.…☆10Nov 14, 2021Updated 4 years ago
- Missing data amputation and exploration functions for Python☆72Dec 17, 2022Updated 3 years ago
- Record matching and entity resolution at scale in Spark☆36Oct 31, 2023Updated 2 years ago
- Stackable cache classes for sharing, encryption, statistics and more on top of cachetools, redis and memcached☆37Dec 14, 2025Updated 2 months ago
- A neural network hyper parameter tuner☆30Jan 2, 2024Updated 2 years ago
- Build dashboards in Jupyter Notebook with numeric and chart boxes☆216Jul 27, 2022Updated 3 years ago
- Extra blocks for scikit-learn pipelines.☆1,382Mar 1, 2026Updated last week
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,046Feb 21, 2024Updated 2 years ago
- JS snippet to send codeblock contents as a query string☆51Jun 11, 2025Updated 8 months ago
- captures logs and makes cron more fun☆81Jan 31, 2026Updated last month
- ☆14Dec 5, 2025Updated 3 months ago
- This is a repository for georeferencing of pushbroom hyperspectral imagery and includes ray-intersection, orthorectification and a coregi…☆11Oct 23, 2024Updated last year
- Simple python script that converts all Excel files (xls, xlsx, xlsm, csv) in a directory into xlsb files.☆10Mar 13, 2023Updated 2 years ago
- 𝗬𝗢𝗨𝗧𝗨𝗕𝗘 𝗜𝗣 𝗕𝗔𝗡 𝗜𝗦𝗦𝗨𝗘 𝗦𝗢𝗟𝗩𝗘𝗗. 𝗠𝗨𝗦𝗜𝗖 𝗕𝗢𝗧 𝗡𝗢 🌱𝗟𝗔𝗚 𝗙𝗔𝗦𝗧 𝗦𝗣𝗘𝗘𝗗 (V2)🏵️𝗕𝗢𝗧 ʏᴛ-ᴅʟᴘ ᴇʀʀᴏʀ …☆10Feb 9, 2026Updated last month
- SciCount is tool focused on counting and classifying of objects in image-like data and scientific images, with training and example datas…☆11Oct 24, 2023Updated 2 years ago
- ☆12Sep 21, 2023Updated 2 years ago
- ☆11Jul 3, 2020Updated 5 years ago