这是一个类,里面包含的有关文本相似度的常用的计算算法,例如,最长公共子序列,最短标记距离,TF-IDF等算法
☆61Mar 28, 2017Updated 9 years ago
Alternatives and similar repositories for TextSimilarity
Users that are interested in TextSimilarity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 用TF特征向量和simhash指纹计算中文文本的相似度☆217Aug 12, 2016Updated 9 years ago
- 多种句子相似度算法☆36May 22, 2018Updated 8 years ago
- multiprocess unsupervised chinese_detect_words ngram_combination☆23Jan 2, 2019Updated 7 years ago
- 基于TF-IDF和余弦定理计算文本相似度☆36Aug 29, 2018Updated 7 years ago
- 基于gensim模块的中文句子相似度计算☆52Aug 1, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 抓取国家统计局数据☆12May 4, 2016Updated 10 years ago
- 基于siamese-lstm的中文句子相似度计算☆129Jul 1, 2018Updated 8 years ago
- 文本查重小程序☆15Nov 19, 2018Updated 7 years ago
- ☆18Mar 7, 2022Updated 4 years ago
- textsum基于tensorflow实现的Seq2Seq-attention模型以及其他策略算法, 来解决摘要生成、主旨提取等(Text Summary)的任务。部分代码是在其他作者代码的基础上修改而来,后期将全部整理重构。☆30Sep 19, 2019Updated 6 years ago
- ☆11Jan 21, 2019Updated 7 years ago
- Let's make Ease of Use in Emacs, Enjoy it!☆12Oct 21, 2024Updated last year
- Implementation of the peer-to-peer simulation used for the experimental evaluation of the Heterogeneous Differential Privacy paper.☆10Jul 5, 2020Updated 5 years ago
- 海量中文文本快速查重☆18Dec 16, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- Scripts to train a seq2seq model using tensorflow 2☆11Dec 9, 2019Updated 6 years ago
- 对四种句子/文本相似度计算方法进行实验与比较☆292Sep 1, 2020Updated 5 years ago
- 利用Doc2Vec计算文本相似度☆139Apr 11, 2018Updated 8 years ago
- Tensorflow implementation of a Neural Attention Model for Abstractive Summarization.☆10Jul 20, 2020Updated 5 years ago
- 基于文本的垃圾短信分类_文本预处理☆13Jan 11, 2016Updated 10 years ago
- 抓取阿里巴巴国际站并将对应的图片保存到excel中☆13Jun 18, 2020Updated 6 years ago
- Chinese new word discovery☆43Aug 30, 2024Updated last year
- Pytorch implementation of RNN, CNN, BiGRU and LSTM for text classifcation☆10Apr 30, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于PaddleNLP的web端文本纠错系统,支持输入文本或上传word文档,显示纠错后文本结果与保存。 技术栈:后端:PaddleNLP +FastAPI;前端:Vue+Element UI☆14May 18, 2022Updated 4 years ago
- Chest Xray Classifier using CNNs and Transfer Learning. The jupyter notebook of interest is titled 'Xrays_alt.ipynb'☆11May 18, 2018Updated 8 years ago
- unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction☆13Apr 20, 2023Updated 3 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- t5-model-onnx,中文拼写纠错,Chinese spelling correction。☆15Sep 18, 2022Updated 3 years ago
- 本项目使用Keras实现Transformer模型来进行文本分类(中文、英文均支持)。☆12Mar 31, 2022Updated 4 years ago
- Predicting breast cancer at 97.51% accuracy with Naive Bayes Classifier for learning purposes.☆13May 1, 2010Updated 16 years ago
- breast Cancer乳腺癌数据挖掘,python sklearn☆11Apr 13, 2019Updated 7 years ago
- 文本特征值提取,采用结巴将文本分词,tf-idf算法得到特征值,以及给出了idf词频文件的训练方法☆21Feb 11, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Chinese Text Generation using LSTM☆11Aug 7, 2017Updated 8 years ago
- 采用样本迁移的BiLSTM拼接CNN,+CRF 做中文分词处理☆11Jun 11, 2019Updated 7 years ago
- IARM: Inter-Aspect Relation Modeling with Memory Networks in Aspect-Based Sentiment Analysis, EMNLP 2018☆48Feb 22, 2019Updated 7 years ago
- 健 康体检系统☆23Dec 16, 2022Updated 3 years ago
- Distributed sentiment analysis on GitHub commit comments☆10Jun 9, 2015Updated 11 years ago
- ☆45Sep 12, 2021Updated 4 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆49May 2, 2021Updated 5 years ago