pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。
☆216Mar 27, 2024Updated last year
Alternatives and similar repositories for pke_zh
Users that are interested in pke_zh are comparing it to the libraries listed below
Sorting:
- 无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!☆109Jun 20, 2022Updated 3 years ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆118Feb 19, 2024Updated 2 years ago
- pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。☆522Sep 25, 2024Updated last year
- Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。☆899Updated this week
- pinyintokenizer, 拼音分词器,将连续的拼音切分为单字拼音列表。☆31Feb 5, 2025Updated last year
- text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。☆4,945Feb 14, 2026Updated 3 weeks ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- 面向金融领域的小样本跨类迁移事件抽取 第三名 方案及代码☆17Dec 23, 2020Updated 5 years ago
- unified embedding model☆876Sep 1, 2023Updated 2 years ago
- 中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com☆3,800Nov 27, 2025Updated 3 months ago
- An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具,仅占35M内存 www.jionlp.com☆555Nov 21, 2023Updated 2 years ago
- dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识…☆331Apr 23, 2024Updated last year
- 实现了Attention-over-Attention Neural Networks for Reading Comprehension☆20Aug 15, 2018Updated 7 years ago
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆24Nov 6, 2024Updated last year
- 介绍作者在学习推荐系统过程中学习到的知识,包括爬虫、大数据、NLP、召回、排序等模块。包括知识总结和代码实践两部分。☆12Dec 8, 2020Updated 5 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 5 years ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆980Sep 14, 2024Updated last year
- 关键词抽取项目☆24Sep 29, 2020Updated 5 years ago
- OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全…☆209May 3, 2024Updated last year
- Chinese Sentiment Classification Tool. 情感极性分类,基于知网、清华、BosonNLP情感词典,易扩展,基准方法,开箱即用。☆104Aug 22, 2023Updated 2 years ago
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,385Jan 12, 2026Updated last month
- DCIC2023 Fraud Risk Identification Competition Solution.☆26Mar 30, 2023Updated 2 years ago
- ☆10Aug 20, 2018Updated 7 years ago
- This repository contains the data used for the paper "Entity Recognition at First Sight: Improving NER with Eye Movement Information" by …☆11Jan 22, 2020Updated 6 years ago
- 文言文信息抽取(实体识别+关系抽取)☆10Feb 24, 2023Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- Agentica: Lightweight async-first Python framework for AI agents. 轻量级异步优先的AI Agent框架,支持工具调用、RAG、多智能体和MCP。☆246Updated this week
- fasttext 多标签文本分类☆13Sep 16, 2019Updated 6 years ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆22Feb 13, 2025Updated last year
- 首届电子商务AI算法大赛TOP2开源代码☆13Aug 31, 2021Updated 4 years ago
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year
- ☆18Mar 4, 2025Updated last year
- Deep Keyphrase Generation with Pre-trained Language Models☆29Feb 23, 2024Updated 2 years ago
- Minimal keyword extraction with BERT☆4,121Feb 3, 2026Updated last month
- a bert for retrieval and generation☆859Feb 26, 2021Updated 5 years ago
- labelit, label tool with active learning, for classification task. 自动标注,基于主动学习,边标注边学习,减少人工标注量。☆31Dec 9, 2022Updated 3 years ago
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago
- A Triton-only attention backend for vLLM☆24Feb 11, 2026Updated 3 weeks ago