obulkin / string-distLinks
A Python library for calculating string distances using C extensions (with a pure Python fallback)
☆17Updated 5 years ago
Alternatives and similar repositories for string-dist
Users that are interested in string-dist are comparing it to the libraries listed below
Sorting:
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 5 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆116Updated last year
- NLP pipeline software using common workflow language☆35Updated 6 years ago
- Time everything in IPython☆126Updated 2 years ago
- A Python biomedical relation extraction package that uses a supervised approach (i.e. needs training data).☆158Updated 2 years ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- Calculate readability scores☆43Updated 6 years ago
- Library for unit extraction - fork of quantulum for python3☆145Updated last year
- Hive Plots in using Python & matplotlib!☆71Updated 7 years ago
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31Updated 4 years ago
- NLM .nxml to text format conversion☆24Updated 10 years ago
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and E…☆42Updated 3 years ago
- Public repository for versioning machine learning data☆42Updated 4 years ago
- Python utilities for Cytoscape and Cytoscape.js☆176Updated 3 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Venn diagrams with word clouds☆51Updated last year
- visJS2jupyter is a tool to bring the interactivity of networks created with vis.js into jupyter notebook cells☆78Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆113Updated 8 months ago
- Classes for ClinicalTrials.gov related projects☆48Updated 6 years ago
- A convolutional neural network model for relation extraction.☆12Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 3 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated 4 months ago
- Language detection extension for spaCy 2.0+☆114Updated 6 years ago
- Programmatically replace input values in a notebook before running it☆118Updated last year
- Deprecated: use the official mirror: https://github.com/rpy2/rpy2☆15Updated 6 years ago
- Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can conta…☆24Updated 6 years ago
- Commenting and annotation for JupyterLab☆103Updated 4 years ago
- cTAKES Python API interface for the Default Clinical Pipeline☆15Updated 7 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago