TF-IDF+Word2vec做文本相似度计算,最好是长文本
☆24Dec 18, 2019Updated 6 years ago
Alternatives and similar repositories for TF-IDF-word2vec-Text-similarity-
Users that are interested in TF-IDF-word2vec-Text-similarity- are comparing it to the libraries listed below
Sorting:
- 中文文本预处理,Word2Vec训练计算文本相似度。☆44Mar 6, 2019Updated 7 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆16Apr 7, 2022Updated 3 years ago
- 对四种句子/文本相似度计算方法进行实验与比较☆291Sep 1, 2020Updated 5 years ago
- "Cross-lingual Language Model Pretraining for Retrieval". (WWW 2021)☆10Jun 17, 2022Updated 3 years ago
- 基于深度学习的中文问答系统☆10Feb 13, 2019Updated 7 years ago
- 实现一个自己的小语言模型☆11Jun 15, 2024Updated last year
- Implement attention model to LSTM using TensorFlow☆10Jul 3, 2018Updated 7 years ago
- [译] ApacheCN 安卓译文集☆11Jan 11, 2022Updated 4 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 3 years ago
- knrm文本相似度☆10Aug 1, 2020Updated 5 years ago
- 基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】☆13Feb 29, 2024Updated 2 years ago
- 刹那是永恒☆13Feb 26, 2020Updated 6 years ago
- Bootstrap Themeroller is an application that lets you customize the look and feel of Twitter's Bootstrap. It also provides a real time pr…☆58Aug 23, 2013Updated 12 years ago
- 人岗匹配模型,采用 dssm方法和deepffm实现☆11Jul 26, 2019Updated 6 years ago
- WinDbg plugin to trace module transitions from a debugged driver.☆44Dec 22, 2025Updated 2 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- The Personal Finance Dashboard sample demonstrates the chart controls from the Ignite UI library acting together with grids, combo boxes …☆12Oct 31, 2023Updated 2 years ago
- 文本相似度算法☆40Nov 1, 2019Updated 6 years ago
- SQL injection detection engine by tokenzing and syntax analysis, like SQLChop☆10May 8, 2017Updated 8 years ago
- 寻找脆弱资产☆11Jun 28, 2024Updated last year
- Springboot + ElasticSearch 构建博客检索系统☆12Mar 5, 2020Updated 6 years ago
- 基于gensim模块,训练LDA(Latent Dirichlet Allocation)模型,用于计算长短文本的相似度.☆12Nov 25, 2020Updated 5 years ago
- Burpsuite Extension for Jsmon☆22Feb 5, 2026Updated last month
- 基于谷歌大规模网页去重simhash算法,对海量文章(长文本)进行去重。☆11Dec 8, 2022Updated 3 years ago
- 基于语义的文本相似度计算☆10Jan 22, 2019Updated 7 years ago
- 百度迁徙指数以及流出去向,全国所有地级市精度☆11May 30, 2020Updated 5 years ago
- A stager and implant that executes remote Web Assembly☆37Feb 4, 2026Updated last month
- A template for creating Django applications that run on Docker Cloud☆12Jul 5, 2016Updated 9 years ago
- [Anti-Forensics, Steganography, Data Exfiltration] Encrypt a file and hide it in any PDF.☆12Jun 8, 2017Updated 8 years ago
- Firewall for VoIP systems☆11Jul 23, 2020Updated 5 years ago
- Create KeyTab PowerShell Script☆16Nov 3, 2020Updated 5 years ago
- My master thesis in which we predict mortality on ICU☆12Oct 3, 2019Updated 6 years ago
- Yara rules for malicious javascript files from public repositories or written by me.☆13Nov 12, 2021Updated 4 years ago
- 《Tensorflow+Keras深度学习人工智能实践应用》书籍附赠源码,自己每一章敲的代码以及所需要的数据文件☆14Oct 28, 2019Updated 6 years ago
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆12Apr 5, 2021Updated 4 years ago
- Java相关的笔记☆12Apr 8, 2021Updated 4 years ago
- The data used for the challenge consist of records from 12,000 ICU stays. ICU stays of less than 48 hours have been excluded.Up to 42 var…☆13Jun 6, 2018Updated 7 years ago
- 员工离职预测训练赛☆10Aug 25, 2017Updated 8 years ago