Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
☆432May 17, 2020Updated 5 years ago
Alternatives and similar repositories for SIFRank_zh
Users that are interested in SIFRank_zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"☆121Mar 20, 2021Updated 5 years ago
- An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具,仅占35M内存 www.jionlp.com☆554Nov 21, 2023Updated 2 years ago
- ☆278Dec 8, 2020Updated 5 years ago
- 基于方差权重因子选词的SIF句向量模型-实验源码☆11Mar 8, 2020Updated 6 years ago
- Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前,本项目实现了一个文本复述任务,用于NLP语料的数据增强。☆215Mar 24, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 神策杯2018高校算法大师赛(中文关键词提取)第二名代码方案☆308May 6, 2020Updated 5 years ago
- 天池 疫情相似句对判定大赛 线上第一名方案☆435Oct 17, 2020Updated 5 years ago
- ☆448Oct 26, 2022Updated 3 years ago
- Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。☆322Jan 3, 2024Updated 2 years ago
- DeepIE: Deep Learning for Information Extraction☆1,943Dec 9, 2022Updated 3 years ago
- 速度更快、效果更好的中文新词发现☆513Mar 15, 2024Updated 2 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- Python Keyphrase Extraction module☆1,591Jul 12, 2023Updated 2 years ago
- 中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com☆3,804Nov 27, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of N…☆662Mar 24, 2023Updated 3 years ago
- 无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!☆109Jun 20, 2022Updated 3 years ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,603May 13, 2024Updated last year
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,179Jul 15, 2025Updated 8 months ago
- CCKS 2019 中文短文本实体链指比赛技术创新奖解决方案☆411Mar 24, 2023Updated 3 years ago
- ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典,包括34万抽象语义库、34万反义语义库、43万同义语义库等,可支持句子扩展、转写、事件抽象与泛化等多种应用场景。☆779Mar 17, 2023Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,104May 9, 2024Updated last year
- Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)☆2,237Mar 11, 2023Updated 3 years ago
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,385May 31, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!☆2,798Mar 15, 2026Updated last week
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,878Mar 18, 2025Updated last year
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- 法研杯2019相似案例匹配第二名解决方案(附数据集和文档),CAIL2020/2021司法考试赛道冠军队伍☆251Jun 4, 2021Updated 4 years ago
- Automated Phrase Mining from Massive Text Corpora in Python.☆174May 23, 2021Updated 4 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,983Nov 21, 2022Updated 3 years ago
- 基于bert进行中文文本纠错☆241Jun 12, 2023Updated 2 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- a bert for retrieval and generation☆859Feb 26, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 关键词抽取,神策杯2018高校算法大师赛比赛,solo 排名3/591☆65Oct 23, 2018Updated 7 years ago
- Python实现中文文本关键词抽取,分别用了TF-IDF、LDA、RNN、LSTM和LR-SGD两类共五种方法,全网最全没有之一。☆33Jan 22, 2021Updated 5 years ago
- Deep Keyphrase Extraction using BERT☆260Feb 21, 2022Updated 4 years ago
- NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.☆286Dec 8, 2022Updated 3 years ago
- 搜索所有中文NLP数据集,附常用英文NLP数据集☆4,422Nov 21, 2022Updated 3 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,440Jul 15, 2025Updated 8 months ago
- Entity and Relation Extraction Based on TensorFlow and BERT. 基于TensorFlow和BERT的管道式实体及关系抽取,2019语言与智能技术竞赛信息抽取任务解决方案。Schema based Knowledge …☆1,230Jun 1, 2020Updated 5 years ago