中文文本相似度计算器
☆170Oct 2, 2024Updated last year
Alternatives and similar repositories for xiangsi
Users that are interested in xiangsi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 经过强化的goose3通用网页提取器(添加作者VX: 862187570 , Python交流学习)☆16Nov 18, 2021Updated 4 years ago
- Code and data of the paper "MCTS: A Multi-Reference Chinese Text Simplification Dataset".☆33Jun 3, 2024Updated last year
- 根据关键词爬取微博内容并进行情感分析☆16Mar 18, 2020Updated 6 years ago
- 基于gensim模块的中文句子相似度计算☆52Aug 1, 2018Updated 7 years ago
- AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…☆39Mar 30, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Chinese Machine Reading 2021海华AI挑战赛·中文阅读理解·技术组·第三名☆20May 27, 2021Updated 4 years ago
- (已失效)自动生成知网期刊文献Bibtex并导入Zotero;自定义无csl文件的Zotero文献导出样式,在任何引用格式需求下实现随写随引;(已被Zotero6.0Beta实现)将所需知网文献批量、自动化导入Zotero。☆12Sep 26, 2024Updated last year
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,613May 13, 2024Updated last year
- 基于Pytorch的Bert应用,包括命名实体识别、情感分析、文本分类以及文本相似度等☆819Jun 18, 2021Updated 4 years ago
- NLP预/后处理工具。☆30Mar 31, 2025Updated last year
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。☆4,963Feb 14, 2026Updated 2 months ago
- 复现了论文《基于主题模型的短文本关键词抽取及扩展》的代码☆31Nov 11, 2020Updated 5 years ago
- 利用Doc2Vec计算文本相似度☆139Apr 11, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 中文文本聚类☆123Jun 21, 2022Updated 3 years ago
- Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。☆902Mar 5, 2026Updated last month
- A simple GPT-3 interface to automate core legal writing tasks☆13Mar 8, 2023Updated 3 years ago
- The source code of the paper "Emotion-aware Chat Machine: Automatic Emotional Response Generation for Human-like Emotional Interaction"☆12Nov 12, 2020Updated 5 years ago
- Predicting treatment effects from RCTs (Circulation: CQO 2019).☆10Jun 21, 2022Updated 3 years ago
- 爬取披露易网站港股上市公司年报pdf文件☆14Jan 13, 2021Updated 5 years ago
- 深交所年报下载爬虫☆15Feb 10, 2021Updated 5 years ago
- 客户价值聚类分析☆15Feb 7, 2018Updated 8 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,881Mar 18, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple Transformers四种任务(分类、命名实体识别、机器阅读理解、语言模型微调)的代码样例,可以切换多种预训练模型。☆23Jun 7, 2022Updated 3 years ago
- ☆10Sep 9, 2024Updated last year
- 爬取巨潮资讯网,批量下载指定企业从2000年至今所有的年报pdf文件。☆21Apr 24, 2021Updated 5 years ago
- 用TF特征向量和simhash指纹计算中文文本的相似度☆217Aug 12, 2016Updated 9 years ago
- 微调预训练语言模型(BERT、Roberta、XLBert等),用于计算两个文本之间的相似度(通过句子对分类任务转换),适用于中文文本☆90Jul 30, 2020Updated 5 years ago
- 2步jackson快速替换fastjson☆12Mar 4, 2021Updated 5 years ago
- The crawler for data on web of science, especially focus on the analysis of citation data☆16Dec 14, 2018Updated 7 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- ☆14Aug 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Nested Named Entity Recognition for Chinese Biomedical Text☆12Jan 25, 2024Updated 2 years ago
- 根据关键词在 ScienceDirect 上批量爬取论文信息并翻译☆17Jan 10, 2018Updated 8 years ago
- 基于langchain和chatglm6b构建的智能问答系统,支持自定义语料☆10Jun 25, 2023Updated 2 years ago
- Developing a legal research tool leveraging ChatGPT / GPT-4☆14Mar 10, 2024Updated 2 years ago
- 基于百度LAC项目的PHP中文智能分词库☆10Jun 25, 2024Updated last year
- Character Embedding + ESIM + Focal Loss for Chinese Answer Sentence Selection☆10Jan 4, 2020Updated 6 years ago
- 使用不同的方法计算相似度☆42Dec 19, 2018Updated 7 years ago