A library implementing different string similarity and distance measures using Python.
☆1,020Nov 12, 2022Updated 3 years ago
Alternatives and similar repositories for python-string-similarity
Users that are interested in python-string-similarity are comparing it to the libraries listed below
Sorting:
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,517Apr 18, 2025Updated 10 months ago
- Python Keyphrase Extraction module☆1,588Jul 12, 2023Updated 2 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,855Aug 2, 2024Updated last year
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,193Dec 15, 2025Updated 2 months ago
- State-of-the-Art Text Embeddings☆18,323Updated this week
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,354Oct 27, 2025Updated 4 months ago
- AutoPhrase: Automated Phrase Mining from Massive Text Corpora☆1,201Jan 27, 2022Updated 4 years ago
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。☆744Feb 16, 2022Updated 4 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,438Jul 29, 2025Updated 7 months ago
- An open-source NLP research library, built on PyTorch.☆11,889Nov 22, 2022Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- Open source annotation tool for machine learning practitioners.☆10,555Feb 17, 2026Updated 2 weeks ago
- Fuzzy String Matching in Python☆9,270Feb 24, 2023Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,740Jan 26, 2026Updated last month
- ccks baidu entity link 实体链接 第一名☆843Dec 19, 2023Updated 2 years ago
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,817Jan 23, 2024Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,708Apr 13, 2025Updated 10 months ago
- 中文近义词:聊天机器人,智能问答工具包☆5,106Feb 1, 2026Updated last month
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 7 years ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,729Updated this week
- A natural language modeling framework based on PyTorch☆6,305Oct 17, 2022Updated 3 years ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆850Aug 22, 2019Updated 6 years ago
- Various Algorithms for Short Text Mining☆472Feb 23, 2026Updated last week
- Text preprocessing, representation and visualization from zero to hero.☆2,915Aug 29, 2023Updated 2 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,391Aug 26, 2021Updated 4 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,696May 8, 2023Updated 2 years ago
- Data augmentation for NLP☆4,645Jun 24, 2024Updated last year
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,981Jul 28, 2024Updated last year
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆769Updated this week
- An Open-Source Package for Neural Relation Extraction (NRE)☆4,449Jan 10, 2024Updated 2 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,232Feb 6, 2026Updated 3 weeks ago
- Textpipe: clean and extract metadata from text☆302Jun 9, 2021Updated 4 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆857Feb 13, 2026Updated 2 weeks ago
- SpaCy 中文模型 | Models for SpaCy that support Chinese☆673Jan 4, 2025Updated last year
- all kinds of text classification models and more with deep learning☆7,951Sep 28, 2023Updated 2 years ago
- Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includ…☆2,388Sep 3, 2024Updated last year
- A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural …☆2,933Nov 7, 2022Updated 3 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,874Jan 20, 2026Updated last month