life4 / textdistance
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
☆3,395Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for textdistance
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,068Updated 3 weeks ago
- extract text from any document. no muss. no fuss.☆3,910Updated this week
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,820Updated last month
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,150Updated 4 months ago
- A natural language modeling framework based on PyTorch☆6,338Updated 2 years ago
- NLP, before and after spaCy☆2,217Updated last year
- Extract Keywords from sentence or Replace keywords in sentences.☆5,597Updated 4 months ago
- Learning embeddings for classification, retrieval and ranking.☆3,947Updated last year
- A fast, efficient universal vector embedding utility package.☆1,627Updated last year
- Multilingual text (NLP) processing toolkit☆2,316Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,263Updated 3 years ago
- Concurrent data pipelines in Python >>>☆1,549Updated last year
- A system for quickly generating training data with weak supervision☆5,812Updated 6 months ago
- An open source python library for automated feature engineering☆7,272Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆5,977Updated last month
- Python library that makes it easy for data scientists to create charts.☆3,535Updated last month
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,299Updated last month
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,541Updated 8 months ago
- Pampy: The Pattern Matching for Python you always dreamed of.☆3,516Updated 2 years ago
- Camelot: PDF Table Extraction for Humans☆3,666Updated last year
- A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural …☆2,942Updated 2 years ago
- Pretty and useful exceptions in Python, automatically.☆4,598Updated last year
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,806Updated 4 months ago
- A static type analyzer for Python code☆4,775Updated last week
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,510Updated 7 months ago
- Port of Google's language-detection library to Python.☆1,729Updated 9 months ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)☆3,215Updated 3 weeks ago
- Rapid fuzzy string matching in Python using various string metrics☆2,732Updated this week
- Beautiful visualizations of how language differs among document types.☆2,250Updated last month
- 🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code☆2,794Updated last year