life4 / textdistance
๐ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
โ3,449Updated 5 months ago
Alternatives and similar repositories for textdistance:
Users that are interested in textdistance are comparing it to the libraries listed below
- Extract Keywords from sentence or Replace keywords in sentences.โ5,634Updated 8 months ago
- ๐ชผ a python library for doing approximate and phonetic matching of strings.โ2,103Updated 2 months ago
- Pampy: The Pattern Matching for Python you always dreamed of.โ3,520Updated last month
- A natural language modeling framework based on PyTorchโ6,329Updated 2 years ago
- Multilingual text (NLP) processing toolkitโ2,325Updated last year
- NLP, before and after spaCyโ2,215Updated last year
- Beautiful visualizations of how language differs among document types.โ2,285Updated 5 months ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)โ14,090Updated this week
- ๐ฎ A refreshing functional take on deep learning, compatible with your favorite librariesโ2,833Updated last month
- Fuzzy String Matching in Pythonโ9,248Updated 2 years ago
- A system for quickly generating training data with weak supervisionโ5,837Updated 10 months ago
- A Python library that generates static type annotations by collecting runtime typesโ4,854Updated 7 months ago
- A fast, efficient universal vector embedding utility package.โ1,642Updated last year
- ๐ดย Call stack profiler for Python. Shows you why your code is slow!โ6,876Updated last week
- Fixes mojibake and other glitches in Unicode text, after the fact.โ3,866Updated 4 months ago
- Concurrent data pipelines in Python >>>โ1,571Updated last year
- Rapid fuzzy string matching in Python using various string metricsโ2,924Updated this week
- Learning embeddings for classification, retrieval and ranking.โ3,949Updated 2 years ago
- An open-source NLP research library, built on PyTorch.โ11,818Updated 2 years ago
- Library to scrape and clean web pages to create massive datasets.โ2,180Updated 4 years ago
- Camelot: PDF Table Extraction for Humansโ3,676Updated 2 years ago
- Computing with Python functions.โ3,995Updated this week
- A lightning fast Finite State machine and REgular expression manipulation library.โ1,834Updated 2 months ago
- A library implementing different string similarity and distance measures using Python.โ1,002Updated 2 years ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)โ3,272Updated this week
- Topic Modelling for Humansโ15,878Updated 2 weeks ago
- Port of Google's language-detection library to Python.โ1,758Updated last year
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSWโ2,648Updated 9 months ago
- NLP made easyโ2,554Updated last year
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languagesโ7,388Updated this week