这是一个类,里面包含的有关文本相似度的常用的计算算法,例如,最长公共子序列,最短标记距离,TF-IDF等算法
☆63Mar 28, 2017Updated 9 years ago
Alternatives and similar repositories for TextSimilarity
Users that are interested in TextSimilarity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python implementaion of optimal string alignment algorithm of SparseDamerauLevenshteinAutomaton for string fuzzy match.☆13Aug 11, 2016Updated 9 years ago
- 多种句子相似度算法☆36May 22, 2018Updated 8 years ago
- multiprocess unsupervised chinese_detect_words ngram_combination☆23Jan 2, 2019Updated 7 years ago
- ☆16Jun 18, 2022Updated 3 years ago
- 多进程分段读取大文件,并统计词频☆10Jan 14, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于TF-IDF和余弦定理计算文本相似度☆36Aug 29, 2018Updated 7 years ago
- 基于gensim模块的中文句子相似度计算☆52Aug 1, 2018Updated 7 years ago
- 抓取国家统计局数据☆13May 4, 2016Updated 10 years ago
- 基于siamese-lstm的中文句子相似度计算☆129Jul 1, 2018Updated 7 years ago
- 文本查重小程序☆15Nov 19, 2018Updated 7 years ago
- ☆18Mar 7, 2022Updated 4 years ago
- A media decoding library based on MoviePy☆10Jun 16, 2025Updated 11 months ago
- Python analytic hierarchy process☆23Oct 3, 2023Updated 2 years ago
- textsum基于tensorflow实现的Seq2Seq-attention模型以及其他策略算法, 来解决摘要生成、主旨提取等(Text Summary)的任务。部分代码是在其他作者代码的基础上修改而来,后期将全部整理重构。☆30Sep 19, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 企业微信提供的获取会话记录数据 sdk 的 Python 绑定 https://open.work.weixin.qq.com/api/doc/90000/90135/91774☆13Sep 14, 2021Updated 4 years ago
- Implementation of the peer-to-peer simulation used for the experimental evaluation of the Heterogeneous Differential Privacy paper.☆10Jul 5, 2020Updated 5 years ago
- Linux driver for tplink-wn725n nano wireless adapter.☆10Apr 7, 2013Updated 13 years ago
- 海量中文文本快速查重☆18Dec 16, 2018Updated 7 years ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- Scripts to train a seq2seq model using tensorflow 2☆11Dec 9, 2019Updated 6 years ago
- 对四种句子/文本相似度计算方法进行实验与比较☆292Sep 1, 2020Updated 5 years ago
- 基于语义的文本相似度计算☆10Jan 22, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 使用不同的方法计算相似度☆42Dec 19, 2018Updated 7 years ago
- 利用Doc2Vec计算文本相似度☆139Apr 11, 2018Updated 8 years ago
- 暴恐事件自动检测☆13Mar 3, 2017Updated 9 years ago
- Tensorflow implementation of a Neural Attention Model for Abstractive Summarization.☆10Jul 20, 2020Updated 5 years ago
- 基于文本的垃圾短信分类_文本预处理☆13Jan 11, 2016Updated 10 years ago
- Chinese new word discovery☆43Aug 30, 2024Updated last year
- Pytorch implementation of RNN, CNN, BiGRU and LSTM for text classifcation☆10Apr 30, 2021Updated 5 years ago
- 基于PaddleNLP的web端文本纠错系统,支持输入文本或上传word文档,显示纠错后文本结果与保存。 技术栈:后端:PaddleNLP +FastAPI;前端:Vue+Element UI☆14May 18, 2022Updated 4 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- t5-model-onnx,中文拼写纠错,Chinese spelling correction。☆15Sep 18, 2022Updated 3 years ago
- 本项目使用Keras实现Transformer模型来进行文本分类(中文、英文均支持)。☆12Mar 31, 2022Updated 4 years ago
- Predicting breast cancer at 97.51% accuracy with Naive Bayes Classifier for learning purposes.☆13May 1, 2010Updated 16 years ago
- There are some reproduced algorithms for learning from imbalanced data, including over-sampling,under-sampling and boosting☆13Jul 30, 2023Updated 2 years ago
- Chinese Text Generation using LSTM☆11Aug 7, 2017Updated 8 years ago
- bert语言模型校验句子的通顺性☆15Aug 17, 2020Updated 5 years ago
- Distributed sentiment analysis on GitHub commit comments☆10Jun 9, 2015Updated 10 years ago