用TF特征向量和simhash指纹计算中文文本的相似度
☆217Aug 12, 2016Updated 9 years ago
Alternatives and similar repositories for text-similarity
Users that are interested in text-similarity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这是一个类,里面包含的有关文本相似度的常用的计算算法,例如,最长公共子序列,最短标记距离,TF-IDF等算法☆63Mar 28, 2017Updated 9 years ago
- 利用Doc2Vec计算文本相似度☆139Apr 11, 2018Updated 7 years ago
- 中文文本语义相似度(Chinese Semantic Text Similarity)语 料库建设☆480Mar 7, 2018Updated 8 years ago
- 《知网》中文词语语义相似度算法☆41Jun 6, 2013Updated 12 years ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- self complement of Sentence Similarity compute based on cilin, hownet, simhash, wordvector,vsm models,基于同义词词林,知网,指纹,字词向量,向量空间模型的句子相似度计算。☆365Dec 15, 2018Updated 7 years ago
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- 使用simhash算法,快速索引和查询大量文本简历☆21Dec 16, 2015Updated 10 years ago
- A Python Implementation of Simhash Algorithm☆1,036Mar 24, 2022Updated 4 years ago
- 使用不同的方法计算相似度☆42Dec 19, 2018Updated 7 years ago
- simhash算法实现海量内容查重☆14Apr 23, 2016Updated 9 years ago
- 中文文档simhash值计算☆1,167Updated this week
- 对四种句子/文本相似度计算方法进行实验与比较☆291Sep 1, 2020Updated 5 years ago
- Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings☆1,418May 6, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆14Apr 26, 2025Updated 11 months ago
- Hello world demonstration for Weblate☆14Jan 20, 2026Updated 2 months ago
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆22Oct 24, 2024Updated last year
- TF-IDF+Word2vec做文本相似度计算,最好是长文本☆24Dec 18, 2019Updated 6 years ago
- This is a AUTOSAR documents specific retriever based on LLM and RAG.☆16Nov 12, 2024Updated last year
- 基于siamese-lstm的中文句子相似度计算☆129Jul 1, 2018Updated 7 years ago
- 文本相似性计算☆28May 30, 2016Updated 9 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Dec 3, 2023Updated 2 years ago
- Text-Similarity Method in Pytorch☆469Dec 9, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- NLP 以及相关的学习实践☆40Apr 26, 2022Updated 3 years ago
- Calculate similarity between documents using TF-IDF weights☆116Nov 27, 2024Updated last year
- 文本相似度计算/文本匹配☆309Feb 8, 2020Updated 6 years ago
- 用于比较两个中文句子相似度的工具☆29Jul 11, 2018Updated 7 years ago
- Useful collection of webrat Textmate snippets meant for use with the RSpec Story and/or Cucumber bundles☆79Aug 7, 2009Updated 16 years ago
- ⛔️ DEPRECATED ~~ GitHub action lint with Vale ✅❎ ~~ DEPRECATED ⛔️☆12Apr 14, 2020Updated 5 years ago
- Consider is a parser for the ThinkGear protocol used by NeuroSky devices (MindSet, BrainBand and others).☆16Apr 3, 2012Updated 13 years ago
- The official repo is now at http://github.com/couchapp/couchapp☆161Feb 20, 2010Updated 16 years ago
- ☆19Dec 2, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A lecture I gave at PyData NYC 2012 on using the networkx python library and Gephi to generate a mapping of the python community on Twitt…☆28Dec 6, 2012Updated 13 years ago
- GitHub page for the TextBundle Markdown/text specification☆24Jul 30, 2014Updated 11 years ago
- ☆19Jun 5, 2023Updated 2 years ago
- Course Materials for Bayesian Psychometric Modeling☆15May 14, 2019Updated 6 years ago
- A proselint linter for use with Phabricator's arc command line tool.☆17Jun 17, 2016Updated 9 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 2 months ago
- A bot to add citation data from OpenCitations to Wikidata☆12May 23, 2023Updated 2 years ago