life4 / textdistance
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
☆3,464Updated last month
Alternatives and similar repositories for textdistance
Users that are interested in textdistance are comparing it to the libraries listed below
Sorting:
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,131Updated last month
- NLP, before and after spaCy☆2,225Updated last year
- Extract Keywords from sentence or Replace keywords in sentences.☆5,648Updated last month
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,174Updated 10 months ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,850Updated last month
- Pampy: The Pattern Matching for Python you always dreamed of.☆3,527Updated 4 months ago
- Fuzzy String Matching in Python☆9,259Updated 2 years ago
- A library implementing different string similarity and distance measures using Python.☆1,008Updated 2 years ago
- Concurrent data pipelines in Python >>>☆1,577Updated last year
- Fixes mojibake and other glitches in Unicode text, after the fact.☆3,909Updated 6 months ago
- Computing with Python functions.☆4,064Updated this week
- Pretty and useful exceptions in Python, automatically.☆4,634Updated 2 years ago
- A natural language modeling framework based on PyTorch☆6,327Updated 2 years ago
- Multilingual text (NLP) processing toolkit☆2,337Updated last year
- Stand-alone language identification system☆2,381Updated 5 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,274Updated 3 years ago
- 🦆 Contextually-keyed word vectors☆1,652Updated 3 weeks ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,167Updated this week
- Python library that makes it easy for data scientists to create charts.☆3,585Updated 7 months ago
- Rapid fuzzy string matching in Python using various string metrics☆3,084Updated this week
- Beautiful visualizations of how language differs among document types.☆2,302Updated 2 weeks ago
- extract text from any document. no muss. no fuss.☆4,126Updated 5 months ago
- A fast, efficient universal vector embedding utility package.☆1,647Updated last year
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,586Updated last month
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,381Updated 7 months ago
- Topic Modelling for Humans☆16,013Updated 3 months ago
- Declarative visualization library for Python☆9,768Updated last week
- A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural …☆2,940Updated 2 years ago
- Python dictionaries with advanced dot notation access☆2,728Updated last week
- Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.☆8,813Updated 11 months ago