TF-IDF+Word2vec做文本相似度计算,最好是长文本
☆24Dec 18, 2019Updated 6 years ago
Alternatives and similar repositories for TF-IDF-word2vec-Text-similarity-
Users that are interested in TF-IDF-word2vec-Text-similarity- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- 中文文本预处理,Word2Vec训练计算文本相似度。☆43Mar 6, 2019Updated 7 years ago
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆15Apr 7, 2022Updated 4 years ago
- 对四种句子/文本相似度计算方法进行实验与比较☆292Sep 1, 2020Updated 5 years ago
- 计算TFIDF的三种方法:Python、sklearn、gensim☆11Feb 26, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 中文文本摘要生成模型☆21Jul 29, 2022Updated 3 years ago
- 基于语义的文本相似度计算☆10Jan 22, 2019Updated 7 years ago
- 人岗匹配模型,采用 dssm方法和deepffm实现☆11Jul 26, 2019Updated 6 years ago
- 生成训练文本检测数据集☆12Jul 1, 2020Updated 5 years ago
- "Cross-lingual Language Model Pretraining for Retrieval". (WWW 2021)☆10Jun 17, 2022Updated 3 years ago
- knrm文本相似度☆10Aug 1, 2020Updated 5 years ago
- 百度迁徙指数以及流出去向,全国所有地级市精度☆11May 30, 2020Updated 5 years ago
- Open web page extractor and keyword extractor for Chinese web pages☆20Aug 19, 2019Updated 6 years ago
- Using sklearn _ Cluster _ Kmeans☆10Apr 11, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 摘要、关键字、关键词组、文本相似度、分词分句(自然语言处理工具包)☆11Aug 16, 2019Updated 6 years ago
- 基于gensim模块,训练LDA(Latent Dirichlet Allocation)模型,用于计算长短文本的相似度.☆12Nov 25, 2020Updated 5 years ago
- 使用sentence-transformers(SBert)训练自己的文本相似度数据集并进行评估。☆49Sep 22, 2021Updated 4 years ago
- 基于jieba分词和lda模型的主题分析☆19Apr 20, 2019Updated 7 years ago
- 基于深度学习的中文问答系统☆10Feb 13, 2019Updated 7 years ago
- 基于谷歌大规模网页去重simhash算法,对海量文章(长文本)进行去重。☆11Dec 8, 2022Updated 3 years ago
- 基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】☆13Feb 29, 2024Updated 2 years ago
- Springboot + ElasticSearch 构建博客检索系统☆12Mar 5, 2020Updated 6 years ago
- ☆17Sep 10, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 以聚类算法、LDA主题模型、分类器为基础,完成对Twitter语料的基于地理位置的主题事件挖掘,并对主题事件进行细粒度的情绪分析☆35Jul 29, 2018Updated 7 years ago
- 实现一个自己的小语言模型☆11Jun 15, 2024Updated last year
- 《自然语言理解与行业知识图谱-概念、方法与工程落地》 一书中介绍的各个章节的算法展示代码☆13Jun 24, 2024Updated last year
- ☆12May 3, 2024Updated 2 years ago
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 6 months ago
- 利用Doc2Vec计算文本相似度☆139Apr 11, 2018Updated 8 years ago
- 本项目包含几种常用 NLP算法的实现:关键词(keyword)、命名实体(named entity)、自动摘要(abstract)、文本相似度比较(text similarity)等☆16Jan 16, 2022Updated 4 years ago
- 离线版中文标注工具,支持NER、文本分类、关系标注、对话标注等。☆14Jul 29, 2022Updated 3 years ago
- OntoEA: Ontology-guided Entity Alignment via Joint Knowledge Graph Embedding @ ACL'21☆25Nov 15, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Zero-Shot Summarization with GPT-3☆17Sep 11, 2023Updated 2 years ago
- Performing Latent Semantic Analysis with Python on large datasets.☆13Jun 21, 2022Updated 3 years ago
- Codes for Pretraining Language Models with Text-Attributed Heterogeneous Graphs☆16Oct 13, 2023Updated 2 years ago
- The data used for the challenge consist of records from 12,000 ICU stays. ICU stays of less than 48 hours have been excluded.Up to 42 var…☆13Jun 6, 2018Updated 7 years ago
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- 用TF特征向量和simhash指纹计算中文文本的相似度☆217Aug 12, 2016Updated 9 years ago
- 小说-实现追书推荐、排行榜检索、搜索书籍、分类检索、标签检索、模拟翻页效果、文章阅读、缓存章节等☆39Dec 13, 2017Updated 8 years ago