A library implementing different string similarity and distance measures using Python.
☆1,020Nov 12, 2022Updated 3 years ago
Alternatives and similar repositories for python-string-similarity
Users that are interested in python-string-similarity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,525Apr 18, 2025Updated 11 months ago
- Python Keyphrase Extraction module☆1,590Jul 12, 2023Updated 2 years ago
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。☆744Feb 16, 2022Updated 4 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,201Mar 10, 2026Updated 2 weeks ago
- State-of-the-Art Text Embeddings☆18,427Mar 12, 2026Updated last week
- Facilitating the design, comparison and sharing of deep text matching models.☆3,855Aug 2, 2024Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,352Oct 27, 2025Updated 4 months ago
- AutoPhrase: Automated Phrase Mining from Massive Text Corpora☆1,202Jan 27, 2022Updated 4 years ago
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,789Mar 11, 2026Updated last week
- Fuzzy String Matching in Python☆9,265Feb 24, 2023Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,104May 9, 2024Updated last year
- ccks baidu entity link 实体链接 第一名☆842Dec 19, 2023Updated 2 years ago
- 中文近义词:聊天机器人,智能问答工具包☆5,104Feb 1, 2026Updated last month
- Open source annotation tool for machine learning practitioners.☆10,583Updated this week
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆865Feb 13, 2026Updated last month
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 7 years ago
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,824Jan 23, 2024Updated 2 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,448Jul 29, 2025Updated 7 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,740Updated this week
- Fuzzy string matching, grouping, and evaluation.☆792Jul 10, 2025Updated 8 months ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,707Apr 13, 2025Updated 11 months ago
- A natural language modeling framework based on PyTorch☆6,306Oct 17, 2022Updated 3 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,974Jul 28, 2024Updated last year
- Text preprocessing, representation and visualization from zero to hero.☆2,909Aug 29, 2023Updated 2 years ago
- Various Algorithms for Short Text Mining☆472Mar 9, 2026Updated 2 weeks ago
- SiameseSentenceSimilarity,个人实现的基于Siamese bilstm模型的相似句子判定模型,提供训练数据集和测试数据集.☆271Dec 5, 2019Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,697May 8, 2023Updated 2 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,238Feb 6, 2026Updated last month
- Four word embedding models implemented in Python. Supporting arbitrary context features☆848Aug 22, 2019Updated 6 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,391Aug 26, 2021Updated 4 years ago
- Data augmentation for NLP☆4,652Jun 24, 2024Updated last year
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,188Oct 30, 2023Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆771Mar 7, 2026Updated 2 weeks ago
- Abydos NLP/IR library for Python☆194Nov 10, 2022Updated 3 years ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,603May 13, 2024Updated last year
- An Open-Source Package for Neural Relation Extraction (NRE)☆4,450Jan 10, 2024Updated 2 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,440Jul 15, 2025Updated 8 months ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,889Apr 13, 2023Updated 2 years ago