xiaoshuwen1995 / Text-Similarity-MatchView external linksLinks
实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。
☆11Jul 30, 2020Updated 5 years ago
Alternatives and similar repositories for Text-Similarity-Match
Users that are interested in Text-Similarity-Match are comparing it to the libraries listed below
Sorting:
- TF-IDF+Word2vec做文本相似度计算,最好是长文本☆24Dec 18, 2019Updated 6 years ago
- 基于TF-IDF和余弦定理计算文本相似度☆36Aug 29, 2018Updated 7 years ago
- A simple tutorial for converting CSV to RDF☆10Mar 30, 2016Updated 9 years ago
- Python3 实现的文章余弦相似度计算☆10Sep 28, 2017Updated 8 years ago
- 基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】☆13Feb 29, 2024Updated last year
- Solve water sort puzzle problem(Chinese name "水排序") using DFS and BFS☆14Mar 17, 2022Updated 3 years ago
- YxVM怎么样?YxVM介绍和测评☆10Feb 7, 2025Updated last year
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- 基于谷歌大规模网页去重simhash算法,对海量文章(长文本)进行去重。☆11Dec 8, 2022Updated 3 years ago
- 这是 Will 保哥在 2013 第 6 届 iT 邦帮忙铁人赛年度大奖的得奖着作【30 天精通 Git 版本控管】,欢迎大家 fork 我,如果有看见 任何文字勘误,也欢迎利用 pull request 来通知我修正,谢谢!☆12Jul 22, 2019Updated 6 years ago
- A large labeled corpus for Application Privacy Policy in Chinese to train named entity recognition models for Android Dangerous PERMSSION…☆11Jun 19, 2025Updated 7 months ago
- 2020腾讯广告算法大赛方案分享及代码(冠军)☆13May 1, 2023Updated 2 years ago
- Latent Drichlet Allocation and Dynamic Topic Modeling☆10Aug 11, 2021Updated 4 years ago
- 抓取淘女郎图片的简单爬虫,对应博文[python爬虫入门教程(三):淘女郎爬虫 ( 接口解析 | 图片下载 )](https://blog.csdn.net/aaronjny/article/details/80291997)。☆11May 13, 2018Updated 7 years ago
- Springboot + ElasticSearch 构建博客检索系统☆12Mar 5, 2020Updated 5 years ago
- lulu is the cutest!☆13Apr 5, 2021Updated 4 years ago
- Convolutional Neural Network for Text Classification in Tensorflow☆10Apr 3, 2017Updated 8 years ago
- 利用Bert获取中文字、词向量☆10Jan 18, 2022Updated 4 years ago
- Automatic missing value imputation using random forests☆14Aug 19, 2015Updated 10 years ago
- Code for the NLPCC 2018 paper: Distant Supervision for Relation Extraction with Neural Instance Selector☆12Mar 7, 2019Updated 6 years ago
- H&M商品推荐比赛(rank: 116/2952 )方案☆14Jun 16, 2022Updated 3 years ago
- Code for our paper: "Share First, Ask Later (or Never?) - Studying Violations of GDPR's Explicit Consent in Android Apps"☆11Oct 20, 2022Updated 3 years ago
- Supplementary code for "News Frame Analysis: An Inductive Mixed-method Computational Approach" http://dx.doi.org/10.1080/19312458.2019.16…☆15Nov 13, 2020Updated 5 years ago
- ☆12May 3, 2024Updated last year
- Path and Trajectory Planning: (A* RRT), Simultaneous Localisation and Mapping: (EKF,FAST),[Source: Python Robotics] and Control Systems: …☆10Mar 27, 2022Updated 3 years ago
- 基于RASA官网搭建的智能聊天机器人☆12Feb 1, 2019Updated 7 years ago
- 本项目包含几种常用 NLP算法的实现:关键词(keyword)、命名实体(named entity)、自动摘要(abstract)、文本相似度比较(text similarity)等☆16Jan 16, 2022Updated 4 years ago
- ☆11Mar 26, 2020Updated 5 years ago
- Dynamic Topic Modelling Tutorial Files☆13May 12, 2015Updated 10 years ago
- 基于UICrawler开源工程,开发的针对android APP 自动化遍历工具,针对hybird或者纯native APP,做深度优先遍历。主要用于监听被抓取APP的页面是否有变动,并生成diff报告☆14Mar 1, 2019Updated 6 years ago
- Implementation of RRT* and Informed-RRT* on Turtlebot in ROS-Gazebo☆15May 28, 2020Updated 5 years ago
- Code and dataset for "Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction" (EMNLP'19)☆13May 18, 2020Updated 5 years ago
- 1,huaproject算福利吧,爬取的中国校花网,并且保存到本地,基础知识点,url,json,文件的读写. 2,Document.doc 是自己总结的常见爬虫面试题以及答案,但是貌似不想做全职爬虫,所以可能以后也不会更新这一块,爬虫算乐趣, 以后估计重心会放在web …☆14Jan 24, 2018Updated 8 years ago
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- 使用SpringBoot+MyBatis架构的大文件上传功能,支持断点续传☆15Oct 30, 2019Updated 6 years ago
- Attention-Based Convolutional Neural Network for Weakly Labeled Human Activities’ Recognition With Wearable Sensors☆12Jul 21, 2020Updated 5 years ago
- 简单状态机实现。同时以简化的订单状态机为例子进行了说明。☆15Oct 13, 2020Updated 5 years ago
- 基于 appium , Android APP UI遍历测试 , Python3.x☆13Nov 13, 2017Updated 8 years ago
- GOAT(山羊)是中英文大语言模型,基于LlaMa进行SFT。☆12Apr 24, 2023Updated 2 years ago