life4 / textdistanceLinks
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
☆3,494Updated 5 months ago
Alternatives and similar repositories for textdistance
Users that are interested in textdistance are comparing it to the libraries listed below
Sorting:
- Multilingual text (NLP) processing toolkit☆2,351Updated last year
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,154Updated last week
- Extract Keywords from sentence or Replace keywords in sentences.☆5,681Updated 5 months ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,876Updated 3 months ago
- NLP, before and after spaCy☆2,230Updated 2 years ago
- Port of Google's language-detection library to Python.☆1,848Updated 7 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,198Updated last week
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,623Updated this week
- Beautiful visualizations of how language differs among document types.☆2,314Updated 5 months ago
- Stand-alone language identification system☆2,429Updated 5 years ago
- A fast, efficient universal vector embedding utility package.☆1,653Updated 2 years ago
- A natural language modeling framework based on PyTorch☆6,319Updated 2 years ago
- 👩🏫 Advanced NLP with spaCy: A free online course☆2,384Updated 8 months ago
- A system for quickly generating training data with weak supervision☆5,923Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,276Updated 4 years ago
- Text preprocessing, representation and visualization from zero to hero.☆2,910Updated 2 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,783Updated last year
- Module for automatic summarization of text documents and HTML pages.☆3,630Updated last month
- Pampy: The Pattern Matching for Python you always dreamed of.☆3,525Updated 8 months ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,033Updated 3 months ago
- Computing with Python functions.☆4,235Updated 2 weeks ago
- Library to scrape and clean web pages to create massive datasets.☆2,208Updated 4 years ago
- A Python toolbox for gaining geometric insights into high-dimensional data☆1,869Updated 3 months ago
- Concurrent data pipelines in Python >>>☆1,587Updated 2 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,389Updated 4 years ago
- python parser for human readable dates☆2,728Updated last month
- extract text from any document. no muss. no fuss.☆4,321Updated 10 months ago
- Fixes mojibake and other glitches in Unicode text, after the fact.☆3,969Updated 11 months ago
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,440Updated this week
- HiPlot makes understanding high dimensional data easy☆2,799Updated last year