xiaoshuwen1995/Text-Similarity-Match

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiaoshuwen1995/Text-Similarity-Match)

xiaoshuwen1995 / Text-Similarity-Match

实现功能：新输入一段文本，与已有数据进行相似度进行比较，返回TOP10的文本。主要实现方法：jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。

☆11

Alternatives and similar repositories for Text-Similarity-Match

Users that are interested in Text-Similarity-Match are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gospel303 / TF-IDF-word2vec-Text-similarity-
View on GitHub
TF-IDF+Word2vec做文本相似度计算，最好是长文本
☆24Dec 18, 2019Updated 6 years ago
nigestream / cosSim
View on GitHub
基于TF-IDF和余弦定理计算文本相似度
☆36Aug 29, 2018Updated 7 years ago
SnailDM / git-test
View on GitHub
☆16Jul 2, 2019Updated 7 years ago
leebird / bionlp17
View on GitHub
Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction
☆11Oct 25, 2017Updated 8 years ago
iamxiatian / x-extractor
View on GitHub
Open web page extractor and keyword extractor for Chinese web pages
☆20Aug 19, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
laihuiyuan / atec2018-nlp
View on GitHub
蚂蚁金服：金融大脑-金融智能NLP大赛(26th/2632)
☆15Apr 8, 2019Updated 7 years ago
ShenaoW / APPCH
View on GitHub
A large labeled corpus for Application Privacy Policy in Chinese to train named entity recognition models for Android Dangerous PERMSSION…
☆11Jun 19, 2025Updated last year
longzhen123 / RippleNet
View on GitHub
基于知识图谱的推荐算法-RippleNet的实现
☆18Nov 29, 2022Updated 3 years ago
YuxiXie / OpenNQG
View on GitHub
☆11Mar 26, 2020Updated 6 years ago
crazyyanchao / TextSummary
View on GitHub
NLP| 自动文本摘要| 热词发现| 新词发现
☆18Apr 28, 2020Updated 6 years ago
Hello-MLClub / Tencent2020_ad
View on GitHub
2020腾讯广告算法大赛方案分享及代码（冠军）
☆14May 1, 2023Updated 3 years ago
ybch14 / RelationExtraction-NIS-PyTorch
View on GitHub
Code for the NLPCC 2018 paper: Distant Supervision for Relation Extraction with Neural Instance Selector
☆12Mar 7, 2019Updated 7 years ago
ZhangYiBo513 / Simhash-
View on GitHub
基于谷歌大规模网页去重simhash算法，对海量文章（长文本）进行去重。
☆11Dec 8, 2022Updated 3 years ago
DannyLee1991 / article_cosine_similarity
View on GitHub
Python3 实现的文章余弦相似度计算
☆10Sep 28, 2017Updated 8 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
handsomecui / chat-robot
View on GitHub
基于RASA官网搭建的智能聊天机器人
☆12Feb 1, 2019Updated 7 years ago
ZillaRU / VideoSearch-tpu
View on GitHub
基于SG2300X的视频检索【使用自然语言搜索视频内容，定位到符合描述的具体时间段】
☆13Feb 29, 2024Updated 2 years ago
willshi2023 / springboot-es
View on GitHub
Springboot + ElasticSearch 构建博客检索系统
☆12Mar 5, 2020Updated 6 years ago
AaronJny / mm_taobao
View on GitHub
抓取淘女郎图片的简单爬虫，对应博文[python爬虫入门教程(三)：淘女郎爬虫 ( 接口解析 | 图片下载 )](https://blog.csdn.net/aaronjny/article/details/80291997)。
☆11May 13, 2018Updated 8 years ago
crabdriver / Python-Spark-2.0-Hadoop-
View on GitHub
本文件为相应图书配套代码，但原书中给出下载地址不稳定本仓库作为补充。
☆16Aug 12, 2021Updated 4 years ago
rxt2012kc / cnn-text-classification-tf
View on GitHub
Convolutional Neural Network for Text Classification in Tensorflow
☆10Apr 3, 2017Updated 9 years ago
tars-sh / chatgpt-for-english-learning
View on GitHub
Let ChatGPT help you learn English in an innovative way
☆14Feb 9, 2023Updated 3 years ago
nakul3112 / Motion_Planning_with_RRTstar_and_InformedRRTstar
View on GitHub
Implementation of RRT* and Informed-RRT* on Turtlebot in ROS-Gazebo
☆15May 28, 2020Updated 6 years ago
xiaobeibeinihao / UICrawler
View on GitHub
基于UICrawler开源工程,开发的针对android APP 自动化遍历工具，针对hybird或者纯native APP，做深度优先遍历。主要用于监听被抓取APP的页面是否有变动，并生成diff报告
☆14Mar 1, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yanchaoni / translate_machine_translation
View on GitHub
Vietnamese and Chinese to English
☆15Dec 17, 2018Updated 7 years ago
JIMHUANG1024 / H-M-Personalized-Fashion-Recommendations
View on GitHub
H&M商品推荐比赛（rank: 116/2952 ）方案
☆15Jun 16, 2022Updated 4 years ago
sunlab-osu / REDS2
View on GitHub
Code and dataset for "Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction" (EMNLP'19)
☆13May 18, 2020Updated 6 years ago
karanchawla / motion-planning-playground
View on GitHub
Playground for motion planning and controls algorithms.
☆15Aug 15, 2018Updated 7 years ago
KennCoder7 / Attention-for-HAR
View on GitHub
Attention-Based Convolutional Neural Network for Weakly Labeled Human Activities’ Recognition With Wearable Sensors
☆12Jul 21, 2020Updated 6 years ago
guannan-he / pathPlanningPaperAndCodes
View on GitHub
lulu is the cutest!
☆13Apr 5, 2021Updated 5 years ago
catid / minigpt4
View on GitHub
MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code
☆15Jun 12, 2023Updated 3 years ago
CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated 2 years ago
zhangzhao4444 / automonkey
View on GitHub
基于 appium , Android APP UI遍历测试， Python3.x
☆13Nov 13, 2017Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Daphnis-z / nlp-ztools
View on GitHub
本项目包含几种常用 NLP算法的实现：关键词(keyword)、命名实体(named entity)、自动摘要(abstract)、文本相似度比较(text similarity)等
☆16Jan 16, 2022Updated 4 years ago
dssg / Random_Forest_Imputer
View on GitHub
Automatic missing value imputation using random forests
☆14Aug 19, 2015Updated 10 years ago
Automanmm / Spark-ALS-CF
View on GitHub
Spark实践：音乐个性化推荐——基于ALS矩阵分解的协同过滤算法
☆21Apr 15, 2019Updated 7 years ago
reactive-systems / ml2
View on GitHub
Machine Learning for Mathematics and Logics
☆16Apr 1, 2025Updated last year
TheDataLeek / Python-LSA
View on GitHub
Performing Latent Semantic Analysis with Python on large datasets.
☆13Jun 21, 2022Updated 4 years ago
m94h / dtm_gensim
View on GitHub
Dynamic Topic Modelling Tutorial Files
☆14May 12, 2015Updated 11 years ago
kikizxd / Class-imbalanced__Credit-Card-Fraud
View on GitHub
分类类别不平衡，解决办法：采样(SMOTE和算法集成技术等)、阈值移动、调整代价或权重，附带信用卡诈骗案例
☆21Oct 8, 2019Updated 6 years ago