wjn1996 / scrapy_for_zh_wikiLinks
基于scrapy的层次优先队列方法爬取中文维基百科,并自动抽取结构和半结构数据
☆156Updated 2 years ago
Alternatives and similar repositories for scrapy_for_zh_wiki
Users that are interested in scrapy_for_zh_wiki are comparing it to the libraries listed below
Sorting:
- KgCLUE: 大规模中文开源知识图谱问答☆450Updated 3 years ago
- SimCSE中文语义相似度对比学习模型☆88Updated 3 years ago
- A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .☆210Updated 3 years ago
- All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借鉴于其他开源项目,原先是自己玩的,后来干脆也开源出来)☆289Updated this week
- ☆39Updated 2 years ago
- 基于BERT的中文命名实体识别☆44Updated 3 years ago
- Implemention of NER model on chinese dataset.☆73Updated 2 years ago
- 关系抽取☆58Updated 2 years ago
- 中文命名实体识别☆47Updated 3 years ago
- 北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。☆472Updated 3 years ago
- 基于pytorch的GlobalPointer进行三元组抽取。☆83Updated 2 years ago
- Chinese-Text-Classification Project including bert-classification, textCNN and so on.☆161Updated 3 years ago
- This is updated version of the dataset for Chinese community medical question answering.☆357Updated 6 years ago
- This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi☆99Updated 2 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆132Updated 3 years ago
- 中文NER的那些事儿☆320Updated last year
- CMeIE/CBLUE/CHIP/实体关系抽取/SPO抽取☆237Updated 3 years ago
- 基于pytorch + bert的多标签文本分类(multi label text classification)☆108Updated 2 years ago
- Using BERT+Bi-LSTM+CRF☆140Updated 3 years ago
- A PyTorch implementation of a BiLSTM\BERT\Roberta(+CRF) model for Named Entity Recognition.☆507Updated 4 years ago
- [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集☆641Updated 2 years ago
- 根据维基中文语料库预训练 GloVe 中文词向量;Pre-train GloVe word-embedding From Chinese Wiki corpus☆77Updated last year
- A tutorial and implement of disease centered Medical knowledge graph and qa system based on it。知识图谱构建,自动问答,基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱…☆70Updated 6 years ago
- 基于pytorch+bert的中文文本分类☆88Updated 2 years ago
- Unified Structure Generation for Universal Information Extraction☆939Updated 3 years ago
- OneRel在中文关系抽取中的使用☆129Updated last year
- ☆107Updated last year
- OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全…☆208Updated last year
- 该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【信息抽取篇】☆29Updated 2 years ago
- bert_seq2seq的DDP版本,支持bert、roberta、nezha、t5、gpt2等模型,支持seq2seq、ner、关系抽取等任务,无需添加额外代码,轻松启动DDP多卡训练。☆53Updated 3 years ago