gospel303 / TF-IDF-word2vec-Text-similarity-View external linksLinks
TF-IDF+Word2vec做文本相似度计算,最好是长文本
☆24Dec 18, 2019Updated 6 years ago
Alternatives and similar repositories for TF-IDF-word2vec-Text-similarity-
Users that are interested in TF-IDF-word2vec-Text-similarity- are comparing it to the libraries listed below
Sorting:
- 中文文本预处理,Word2Vec训练计算文本相似度。☆44Mar 6, 2019Updated 6 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- 中文文本摘要生成模型☆21Jul 29, 2022Updated 3 years ago
- 对四种句子/文本相似度计算方法进行实验与比较☆291Sep 1, 2020Updated 5 years ago
- GPS Tracker ,基于GPS、北斗的定位器,用于车辆、宠物、老人等需要定位追踪的场景☆24May 23, 2025Updated 8 months ago
- 基于深度学习的中文问答系统☆10Feb 13, 2019Updated 7 years ago
- Implement attention model to LSTM using TensorFlow☆10Jul 3, 2018Updated 7 years ago
- 实现一个自己的小语言模型☆11Jun 15, 2024Updated last year
- 利用Doc2Vec计算文本相似度☆139Apr 11, 2018Updated 7 years ago
- 基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】☆13Feb 29, 2024Updated last year
- Yara rules☆10Dec 10, 2019Updated 6 years ago
- [译] ApacheCN 安卓译文集☆11Jan 11, 2022Updated 4 years ago
- 人岗匹配模型,采用 dssm方法和deepffm实现☆11Jul 26, 2019Updated 6 years ago
- 计算TFIDF的三种方法:Python、sklearn、gensim☆11Feb 26, 2019Updated 6 years ago
- Bootstrap Themeroller is an application that lets you customize the look and feel of Twitter's Bootstrap. It also provides a real time pr…☆58Aug 23, 2013Updated 12 years ago
- 刹那是永恒☆13Feb 26, 2020Updated 5 years ago
- knrm文本相似度☆10Aug 1, 2020Updated 5 years ago
- fscan结果优化,更新DC域筛选☆10Nov 21, 2023Updated 2 years ago
- The Personal Finance Dashboard sample demonstrates the chart controls from the Ignite UI library acting together with grids, combo boxes …☆12Oct 31, 2023Updated 2 years ago
- 文本相似度算法☆40Nov 1, 2019Updated 6 years ago
- 🔫 lkm module for emergency binary/script execution☆12Dec 22, 2017Updated 8 years ago
- 基于谷歌大规模网页去重simhash算法,对海量文章(长文本)进行去重。☆11Dec 8, 2022Updated 3 years ago
- Using sklearn _ Cluster _ Kmeans☆10Apr 11, 2018Updated 7 years ago
- A stager and implant that executes remote Web Assembly☆37Feb 4, 2026Updated last week
- Springboot + ElasticSearch 构建博客检索系统☆12Mar 5, 2020Updated 5 years ago
- 基于gensim模块,训练LDA(Latent Dirichlet Allocation)模型,用于计算长短文本的相似度.☆12Nov 25, 2020Updated 5 years ago
- This repo is for anonymized review. We will keep updating and optimizing this program.☆15Oct 18, 2024Updated last year
- A firefox extension that blocks distracting websites to stay focused when you need to get things done☆19Sep 27, 2025Updated 4 months ago
- 影速(GoComicMosaic的独立子项目)是一个用于测试影视采集接口速度和稳定性的工具。通过批量测试多个采集接口的响应时间、成功率和结果数量,进行综合打分排序。☆28Jun 17, 2025Updated 8 months ago
- 百度迁徙指数以及流出去向,全国所有地级市精度☆10May 30, 2020Updated 5 years ago
- 使用sentence-transformers(SBert)训练自己的文本相似度数据集并进行评估。☆49Sep 22, 2021Updated 4 years ago
- A template for creating Django applications that run on Docker Cloud☆12Jul 5, 2016Updated 9 years ago
- Threat Detection Rules (Snort/Sigma/Yara)☆14Jan 23, 2024Updated 2 years ago
- ☆12May 3, 2024Updated last year
- My master thesis in which we predict mortality on ICU☆12Oct 3, 2019Updated 6 years ago
- 本项目包含几种常用 NLP算法的实现:关键词(keyword)、命名实体(named entity)、自动摘要(abstract)、文本相似度比较(text similarity)等☆16Jan 16, 2022Updated 4 years ago
- ☆30Jan 16, 2026Updated last month
- 基于Lucene、TF-IDF、余弦相似度的文本相似度算法☆12Jul 25, 2018Updated 7 years ago
- ☆15Nov 22, 2023Updated 2 years ago