A library implementing different string similarity and distance measures using Python.
☆1,019Nov 12, 2022Updated 3 years ago
Alternatives and similar repositories for python-string-similarity
Users that are interested in python-string-similarity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,526Apr 18, 2025Updated last year
- Python Keyphrase Extraction module☆1,590Jul 12, 2023Updated 2 years ago
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。☆744Feb 16, 2022Updated 4 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,209Apr 7, 2026Updated 3 weeks ago
- State-of-the-Art Text Embeddings☆18,615Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Facilitating the design, comparison and sharing of deep text matching models.☆3,848Aug 2, 2024Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,370Oct 27, 2025Updated 6 months ago
- AutoPhrase: Automated Phrase Mining from Massive Text Corpora☆1,201Jan 27, 2022Updated 4 years ago
- An open-source NLP research library, built on PyTorch.☆11,891Nov 22, 2022Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,871Apr 20, 2026Updated 2 weeks ago
- Fuzzy String Matching in Python☆9,257Feb 24, 2023Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,105May 9, 2024Updated last year
- ccks baidu entity link 实体链接 第一名☆842Dec 19, 2023Updated 2 years ago
- 中文近义词:聊天机器人,智能问答工具包☆5,103Feb 1, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆869Apr 20, 2026Updated last week
- Open source annotation tool for machine learning practitioners.☆10,639Apr 14, 2026Updated 2 weeks ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 8 years ago
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,836Jan 23, 2024Updated 2 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,455Jul 29, 2025Updated 9 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,783Apr 25, 2026Updated last week
- Fuzzy string matching, grouping, and evaluation.☆794Jul 10, 2025Updated 9 months ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,711Apr 13, 2025Updated last year
- A natural language modeling framework based on PyTorch☆6,301Oct 17, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,970Jul 28, 2024Updated last year
- Text preprocessing, representation and visualization from zero to hero.☆2,910Aug 29, 2023Updated 2 years ago
- Various Algorithms for Short Text Mining☆471Updated this week
- SiameseSentenceSimilarity,个人实现的基于Siamese bilstm模型的相似句子判定模型,提供训练数据集和测试数据集.☆271Dec 5, 2019Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,697May 8, 2023Updated 2 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,254Feb 6, 2026Updated 2 months ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆847Aug 22, 2019Updated 6 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,391Aug 26, 2021Updated 4 years ago
- Data augmentation for NLP☆4,656Jun 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,211Oct 30, 2023Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆779Apr 1, 2026Updated last month
- Abydos NLP/IR library for Python☆194Nov 10, 2022Updated 3 years ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,613May 13, 2024Updated last year
- An Open-Source Package for Neural Relation Extraction (NRE)☆4,457Jan 10, 2024Updated 2 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,438Apr 19, 2026Updated 2 weeks ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,891Apr 13, 2023Updated 3 years ago