A library implementing different string similarity and distance measures using Python.
☆1,018Nov 12, 2022Updated 3 years ago
Alternatives and similar repositories for python-string-similarity
Users that are interested in python-string-similarity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,529Apr 18, 2025Updated last year
- Python Keyphrase Extraction module☆1,590Jul 12, 2023Updated 2 years ago
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。☆744Feb 16, 2022Updated 4 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,212Apr 7, 2026Updated last month
- State-of-the-Art Embeddings, Retrieval, and Reranking☆18,711Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Facilitating the design, comparison and sharing of deep text matching models.☆3,850Aug 2, 2024Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,376Oct 27, 2025Updated 6 months ago
- AutoPhrase: Automated Phrase Mining from Massive Text Corpora☆1,201Jan 27, 2022Updated 4 years ago
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,917May 11, 2026Updated last week
- Fuzzy String Matching in Python☆9,257Feb 24, 2023Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,110May 9, 2024Updated 2 years ago
- ccks baidu entity link 实体链接 第一名☆841Dec 19, 2023Updated 2 years ago
- 中文近义词:聊天机器人,智能问答工具包☆5,103Feb 1, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆870Apr 20, 2026Updated last month
- Open source annotation tool for machine learning practitioners.☆10,648Apr 14, 2026Updated last month
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 8 years ago
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,834Jan 23, 2024Updated 2 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,463Jul 29, 2025Updated 9 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,790Updated this week
- Fuzzy string matching, grouping, and evaluation.☆796Jul 10, 2025Updated 10 months ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,710Apr 13, 2025Updated last year
- A natural language modeling framework based on PyTorch☆6,299Oct 17, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,968Jul 28, 2024Updated last year
- Text preprocessing, representation and visualization from zero to hero.☆2,911Aug 29, 2023Updated 2 years ago
- Various Algorithms for Short Text Mining☆471Updated this week
- SiameseSentenceSimilarity,个人实现的基于Siamese bilstm模型的相似句子判定模型,提供训练数据集和测试数据集.☆271Dec 5, 2019Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,700May 8, 2023Updated 3 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,254Feb 6, 2026Updated 3 months ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆847Aug 22, 2019Updated 6 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,391Aug 26, 2021Updated 4 years ago
- Data augmentation for NLP☆4,658Jun 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,221Oct 30, 2023Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆780Apr 1, 2026Updated last month
- Abydos NLP/IR library for Python☆194Nov 10, 2022Updated 3 years ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,613May 13, 2024Updated 2 years ago
- An Open-Source Package for Neural Relation Extraction (NRE)☆4,463Jan 10, 2024Updated 2 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,437Apr 19, 2026Updated last month
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,891Apr 13, 2023Updated 3 years ago