An efficient algorithm for text similarity computation
☆60Apr 26, 2021Updated 4 years ago
Alternatives and similar repositories for simhash
Users that are interested in simhash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text retrieval database based on simhash similarity search☆25Mar 27, 2023Updated 2 years ago
- 检查实验报告内容的相似度。 实验报告以word文档形式存在,doc或docx为扩展名。 使用simhash算法检测。☆13May 24, 2018Updated 7 years ago
- 提取新闻内容页的标题,时间,正文,无需配置☆18Aug 19, 2016Updated 9 years ago
- opencart2.0 中文包 简化注册 支付宝 适应中国国情☆10Nov 14, 2014Updated 11 years ago
- ☆23Nov 5, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Apr 17, 2013Updated 12 years ago
- Simhash Java单机实现☆116May 20, 2022Updated 3 years ago
- 基于标题分类的主题句提取方法可描述为: 给定一篇新闻报道, 计算标题与新闻主题词集的相似度, 判断标题是否具有提示性。对于提示性标题,抽取新闻报道中与其最相似的句子作为主题句; 否则, 综合利用多种特征计算新闻报道中句子的重要性, 将得分最高的句子作为主题句。☆40Jul 26, 2016Updated 9 years ago
- Frontend application of a Headless blog using Strapi as CMS☆10Mar 17, 2026Updated last week
- 中文文档simhash值计算☆1,168Mar 13, 2026Updated last week
- ☆22Jan 8, 2019Updated 7 years ago
- ☆61Jul 19, 2024Updated last year
- Amazon like e-commerce Dapp with open sales, customized stores, auctions system and product reviews built on the EVM blockchains☆19Jun 5, 2023Updated 2 years ago
- Load Tensorflow pb file using Bert/TextCNNs, an ensemble model using Java.☆11Aug 20, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 一个基于分布式爬虫的信安文章搜索引擎☆27May 22, 2023Updated 2 years ago
- 基于谷歌大规模网页去重simhash算法,对海量文章(长文本)进行去重。☆11Dec 8, 2022Updated 3 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- a demo for how to execute bert_base_chinese based model in java☆10Mar 8, 2019Updated 7 years ago
- CSDN博客的关键词提取算法,融合TF,IDF,词性,位置等多特征。该项目用于参加2017 SMP用户画像测评,排名第四,在验证集中精度为59.9%,在最终集中精度为58.7%。启发式的方法,通用性强。☆30Dec 13, 2017Updated 8 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- CVE-2024-30056 Microsoft Edge (Chromium-based) Information Disclosure Vulnerability☆17May 27, 2024Updated last year
- ☆12May 3, 2024Updated last year
- ☆13Jun 1, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 本项目包含几种常用 NLP算法的实现:关键词(keyword)、命名实体(named entity)、自动摘要(abstract)、文本相似度比较(text similarity)等☆16Jan 16, 2022Updated 4 years ago
- 电商交易秒杀系统☆10Jul 26, 2017Updated 8 years ago
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.☆10Jul 27, 2018Updated 7 years ago
- Vue 3.x、Ant Design Vue 2.x、国际化、router、ui仿antd-admin-pro、动态路由、动态菜单权限、页面状态缓存☆12Feb 4, 2021Updated 5 years ago
- Website of the Floripa+ organization - built with Next.js and integrated with Strapi CMS☆18Mar 17, 2021Updated 5 years ago
- Performing Latent Semantic Analysis with Python on large datasets.☆13Jun 21, 2022Updated 3 years ago
- A small set of weather related icons☆27Mar 3, 2019Updated 7 years ago
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- ☆12Mar 1, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- iOS 柱状图,支持多个 Y 轴☆24Mar 21, 2016Updated 10 years ago
- A Python module for extracting relevant tags from text documents.☆17May 13, 2011Updated 14 years ago
- [ICADL] Named entity recognition architecture combining contextual and global features☆13Dec 14, 2021Updated 4 years ago
- 文件透明过滤驱动☆15Mar 31, 2013Updated 12 years ago
- 利用Bert获取中文字、词向量☆10Jan 18, 2022Updated 4 years ago
- content-based recommendation system using numpy and scipy☆11Jan 30, 2017Updated 9 years ago
- ☆11Jul 31, 2018Updated 7 years ago