nap / jaro-winkler-distanceLinks
Finds the Jaro Winkler Distance indicating a distance or similarity score between two strings.
☆27Updated 5 months ago
Alternatives and similar repositories for jaro-winkler-distance
Users that are interested in jaro-winkler-distance are comparing it to the libraries listed below
Sorting:
- Original, standard and customisable versions of the Jaro-Winkler functions.☆31Updated 2 years ago
- Load embeddings and featurize your sentences.☆30Updated 8 months ago
- Language detection using Spacy and Fasttext☆57Updated last year
- ☆70Updated 2 years ago
- Lightning Fast Language Prediction 🚀☆167Updated 6 years ago
- Code examples for Google Natural Language API.☆13Updated 5 years ago
- ☆30Updated 3 years ago
- A web application tagging and retrieval of arguments in text☆29Updated 2 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Levenshtein and Hamming distance computation☆116Updated 5 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Stand-alone WordNet API☆49Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Accurately find/replace/remove emojis in text strings☆163Updated last year
- A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.☆124Updated last year
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 3 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 5 years ago
- SImple SenTence EmbeddeR☆74Updated 2 years ago
- Graph extraction and NLP analysis for Baleen Corpora☆18Updated 8 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 4 years ago
- A Python implementation of the uncertainty classifier, based on the work of Veronika Vincze.☆17Updated 11 months ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆86Updated last month
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Prebuilt .whl files for MacOS + Linux of the Facebook FAISS library☆56Updated 3 years ago
- Text readability metrics in Python.☆11Updated 11 years ago
- Using questions to summarize large amounts of textual data.☆25Updated 4 years ago