A simple fuzzy matching set for python strings
☆232Aug 15, 2024Updated last year
Alternatives and similar repositories for fuzzyset
Users that are interested in fuzzyset are comparing it to the libraries listed below
Sorting:
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Aug 9, 2022Updated 3 years ago
- Fuzzy String Matching in Python☆9,270Feb 24, 2023Updated 3 years ago
- A rangeset utility for python☆14Dec 12, 2023Updated 2 years ago
- Stripe payment integration for Salesman.☆12Feb 23, 2023Updated 3 years ago
- Lazy python recipes.☆10Apr 17, 2021Updated 4 years ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated last month
- limit spend on AWS based on a tag, stop compute instances at threshold☆13Apr 13, 2020Updated 5 years ago
- ☆13Jun 22, 2017Updated 8 years ago
- A set of Python scripts for helping manage projects on the commandline☆17Feb 1, 2012Updated 14 years ago
- ☆14Mar 9, 2017Updated 8 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,193Dec 15, 2025Updated 2 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,046Feb 21, 2024Updated 2 years ago
- Links parts of input text to Wikipedia articles☆16Sep 9, 2012Updated 13 years ago
- a CLI tool to interact with Docker registries☆13Oct 5, 2017Updated 8 years ago
- An open-source 2D snake game made with Pyglet and Python 3.8.☆12Mar 12, 2025Updated 11 months ago
- Uses TF-IDF and inverted search to cluster search results☆22Mar 10, 2011Updated 14 years ago
- Super Fast String Matching in Python☆371Mar 14, 2025Updated 11 months ago
- Fuzzy joins for python pandas - easily join different datasets☆59Aug 11, 2020Updated 5 years ago
- Pre-trained models for tokenization, sentence segmentation and so on☆15Aug 22, 2017Updated 8 years ago
- This is an attempt to familiarize myself with PyTorch. In this example, the target to generate a sequence of continuous data (sine waves …☆15Dec 26, 2017Updated 8 years ago
- (Archived) A Python library for record linkage and deduplication.☆19Mar 19, 2024Updated last year
- This is a work in progress Pytorch implementation of the recently proposed ES-RNN by Slawek Smyl, winner of the M4 competition☆12Apr 9, 2019Updated 6 years ago
- Example of configuring multiplage apps via a custom config file☆18Nov 14, 2023Updated 2 years ago
- Probabilistic Programming in Python. Uses Theano as a backend and includes the NUTS sampler.☆12Apr 19, 2017Updated 8 years ago
- The useful and used parts of NN-Dropout☆25Jun 4, 2015Updated 10 years ago
- workflow support for reproducible deduplication and merging☆16Jun 29, 2023Updated 2 years ago
- A collection of Pandas helper functions.☆14Apr 4, 2023Updated 2 years ago
- Publicly available data for Paperscape☆45Mar 19, 2018Updated 7 years ago
- Django app: forcing the model to call full_clean() method on save.☆17Jun 27, 2021Updated 4 years ago
- This repository contains machine learning related work for the corpus to graph project, including Jupyter research notebooks and a Flask …☆46Sep 18, 2016Updated 9 years ago
- ☆18Apr 25, 2018Updated 7 years ago
- PyMix - The Python mixture package☆16Nov 9, 2015Updated 10 years ago
- convert sqlite database to duckdb database☆27May 23, 2024Updated last year
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Aug 30, 2010Updated 15 years ago
- Data visualization and analysis library based on the pydata stack☆19May 31, 2017Updated 8 years ago
- Scripts to launch IPython parallel on a cluster and run an IPython Notebook to run the analysis.☆27Feb 24, 2014Updated 12 years ago
- A command-line syndication feed monitor☆49Feb 7, 2026Updated 3 weeks ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,440Jul 29, 2025Updated 7 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,278Aug 11, 2021Updated 4 years ago