SkydustZ / AEC-domain-corpora
The code and dataset for the paper "Pretrained Domain-Specific Language Model for General Information Retrieval Tasks in the AEC Domain"
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for AEC-domain-corpora
- 基于GlobalPointer的实体/关系/事件抽取☆142Updated 2 years ago
- ☆212Updated last year
- LLM for NER☆55Updated 3 months ago
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆170Updated last year
- [IJCAI 2021] Document-level Relation Extraction as Semantic Segmentation☆132Updated last year
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆214Updated last year
- 基于pytorch的百度UIE命名实体识别。☆54Updated last year
- 信息抽取相关论文。☆70Updated last year
- 关系抽取☆52Updated last year
- 💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.☆174Updated last year
- CoSENT、STS、SentenceBERT☆162Updated last year
- 东南大学多模态知识图谱-OpenRichpedia工程文件☆27Updated 3 years ago
- 基于scrapy的层次优先队列方法爬取中文维基百科,并自动抽取结构和半结构数据☆134Updated last year
- Knowledge Graph☆169Updated 2 years ago
- Minimal keyword extraction with BERT☆75Updated 3 years ago
- RoFormer升级版☆149Updated 2 years ago
- 中文机器阅读理解数据集☆100Updated 3 years ago
- 基于词汇信息融合的中文NER模型☆162Updated 2 years ago
- We released BERT-wwm, a Chinese pre-training model based on Whole Word Masking technology, and models closely related to this technology.…☆58Updated last year
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆113Updated 9 months ago
- PDF parsing toolkit for preparing academic text corpus☆49Updated 4 months ago
- 中文bigbird预训练模型☆89Updated 2 years ago
- Chinese-Text-Classification Project including bert-classification, textCNN and so on.☆145Updated 2 years ago
- ☆106Updated 8 months ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆76Updated last year
- NLP文本增强的两种方式:同义词替换(利用word2vec词表)和回译☆71Updated 3 years ago
- ☆32Updated 2 years ago
- 基于pytorch + bert的多标签文本分类(multi label text classification)☆91Updated last year
- ☆37Updated 3 months ago