shibing624/pke_zh

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shibing624/pke_zh)

shibing624 / pke_zh

pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具，实现了KeyBert、PositionRank、TopicRank、TextRank等算法，开箱即用。

☆216

Alternatives and similar repositories for pke_zh

Users that are interested in pke_zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JackHCC / Chinese-Keyphrase-Extraction
View on GitHub
无监督中文关键词抽取（Keyphrase Extraction），基于统计，基于图【LDA与PageRank（TextRank， TPR， Salience Rank， Single TPR等）】，基于嵌入【SIFRank等】，开箱即用！
☆109Jun 20, 2022Updated 4 years ago
taishan1994 / chinese_keyword_extraction
View on GitHub
中文关键词提取
☆14Aug 7, 2023Updated 2 years ago
deepdialog / ZhKeyBERT
View on GitHub
Minimal keyword extraction with BERT
☆88Nov 17, 2021Updated 4 years ago
shibing624 / pinyin-tokenizer
View on GitHub
pinyintokenizer, 拼音分词器，将连续的拼音切分为单字拼音列表。
☆31Feb 5, 2025Updated last year
shibing624 / pytextclassifier
View on GitHub
pytextclassifier is a toolkit for text classification. 文本分类，LR，Xgboost，TextCNN，FastText，TextRNN，BERT等分类模型实现，开箱即用。
☆524Sep 25, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shibing624 / nerpy
View on GitHub
🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具，支持BertSoftmax、BertSpan等模型，开箱即用。
☆118Feb 19, 2024Updated 2 years ago
pkunlp-icler / GAIN
View on GitHub
Source code for EMNLP 2020 paper: Double Graph Based Reasoning for Document-level Relation Extraction
☆16Nov 20, 2020Updated 5 years ago
shibing624 / similarities
View on GitHub
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包，支持亿级数据文搜文、文搜图、图搜图，python3开发，开箱即用。
☆903Mar 5, 2026Updated 4 months ago
shibing624 / text2vec
View on GitHub
text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。
☆4,971Feb 14, 2026Updated 4 months ago
BarryZM / CCKS2020Task4-3rd
View on GitHub
面向金融领域的小样本跨类迁移事件抽取第三名方案及代码
☆16Dec 23, 2020Updated 5 years ago
wjx-git / KeyWordsExtraction
View on GitHub
中文短文本关键词抽取
☆12Nov 29, 2021Updated 4 years ago
THUKElab / CCL2023-CLTC-THU_KELab
View on GitHub
This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…
☆15Nov 25, 2023Updated 2 years ago
dongrixinyu / JioNLP
View on GitHub
中文 NLP 预处理、解析工具包，准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
☆3,850Jun 5, 2026Updated last month
dongrixinyu / chinese_keyphrase_extractor
View on GitHub
An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具，仅占35M内存 www.jionlp.com
☆554Nov 21, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
banifeng / keyword-extractor
View on GitHub
关键词抽取项目
☆24Sep 29, 2020Updated 5 years ago
wangyuxinwhy / uniem
View on GitHub
unified embedding model
☆877Sep 1, 2023Updated 2 years ago
hrwise-nlp / Cue-CoT
View on GitHub
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]
☆24Nov 18, 2023Updated 2 years ago
shibing624 / case-analysis
View on GitHub
NLP之病历分析：从病历文本之中提取关键信息，便于后续分析处理。
☆22Feb 12, 2017Updated 9 years ago
zhangyi24 / sentence_transformer_zh
View on GitHub
☆32May 30, 2021Updated 5 years ago
LinhanZ / mderank
View on GitHub
This is code for paper: MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction
☆66Nov 10, 2022Updated 3 years ago
aysent / supervised-term-weighting
View on GitHub
Supervised Term Weighting Schemes for Text Classification
☆19Feb 25, 2019Updated 7 years ago
EdisonLeeeee / DCIC-2023-Solution
View on GitHub
DCIC2023 Fraud Risk Identification Competition Solution.
☆26Mar 30, 2023Updated 3 years ago
catqaq / OpenTextClassification
View on GitHub
OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全…
☆212May 3, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
skykiseki / textrank4ch
View on GitHub
基于Textrank的关键字提取 & 摘要提取
☆18Sep 15, 2023Updated 2 years ago
AdeDZY / DeepCT
View on GitHub
DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.
☆325May 9, 2021Updated 5 years ago
shibing624 / dialogbot
View on GitHub
dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人，基于问答型对话、任务型对话、聊天型对话等模型实现，支持网络检索问答，领域知识…
☆329Apr 23, 2024Updated 2 years ago
ZhuiyiTechnology / simbert
View on GitHub
a bert for retrieval and generation
☆860Feb 26, 2021Updated 5 years ago
shibing624 / pycorrector
View on GitHub
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
☆6,483Jun 4, 2026Updated last month
uclanlp / DeepKPG
View on GitHub
Deep Keyphrase Generation with Pre-trained Language Models
☆29Feb 23, 2024Updated 2 years ago
svjack / tableQA-Chinese
View on GitHub
Unsupervised tableQA and databaseQA on chinese finance question and tabular data
☆13Apr 20, 2023Updated 3 years ago
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,109May 9, 2024Updated 2 years ago
MaartenGr / KeyBERT
View on GitHub
Minimal keyword extraction with BERT
☆4,198May 13, 2026Updated last month
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
laomagic / TextClassifier
View on GitHub
THUCNews中文文本分类数据集，该数据集包含84万篇新闻文档，总计14类；在该模型的基础上测试多个版本bert分类效果。
☆73Feb 2, 2021Updated 5 years ago
shengtaovvv / Dialogue
View on GitHub
本项目由三个模块构成。意图识别：判断用户的意图是业务型还是闲聊型；模型检索：该部分构建一个语料库，当用户发起新的query（通过意图识别判断为业务型对话）时，为用户匹配query检索的最佳response，使用HSWN进行召回（粗排），然后构建句子的相似度，并利用Lig…
☆12Feb 18, 2021Updated 5 years ago
shibing624 / relext
View on GitHub
RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。
☆49Jun 9, 2022Updated 4 years ago
stat-fit / westat
View on GitHub
a python package for stat,caculate woe,iv, ks,auc,roc,psi and plot.
☆17Jul 17, 2023Updated 2 years ago
itjieluo / EcommerceShoppingGuideRobot
View on GitHub
基于电商导购机器人，自然语言理解（NLU），文本纠错，歧义词消歧
☆12May 5, 2020Updated 6 years ago
boudinfl / pke
View on GitHub
Python Keyphrase Extraction module
☆1,590Jul 12, 2023Updated 2 years ago
shibing624 / textgen
View on GitHub
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型，实现了包括LLaMA，ChatGLM，BLO…
☆980Sep 14, 2024Updated last year