SkydustZ / AEC-domain-corpora
The code and dataset for the paper "Pretrained Domain-Specific Language Model for General Information Retrieval Tasks in the AEC Domain"
☆21Updated 2 years ago
Alternatives and similar repositories for AEC-domain-corpora:
Users that are interested in AEC-domain-corpora are comparing it to the libraries listed below
- Automated rule transformation for automated rule checking☆32Updated 2 years ago
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆174Updated last year
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆225Updated last year
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆113Updated last year
- Examples about using MGeo finetune models☆43Updated 2 years ago
- LLM for NER☆70Updated 8 months ago
- 基于GlobalPointer的实体/关系/事件抽取☆146Updated 3 years ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆300Updated 8 months ago
- 中文标注工具,支持NER、文本分类、关系标注、对话标注等。☆70Updated 8 months ago
- 针对建筑规范文本数据的知识图谱实体关系提取,知识图谱构建,检索增强生成DEMO☆22Updated 8 months ago
- A tutorial and implement of disease centered Medical knowledge graph and qa system based on it。知识图谱构建,自动问答,基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱…☆69Updated 6 years ago
- 基于pytorch的GlobalPointer进行中文命名实体识别。☆37Updated last year
- 基于Bilstm + CRF的信息抽取模型☆33Updated 3 years ago
- 使用python自动构建知识图谱,百万、千万、亿万级别☆39Updated last year
- chinese document classification of layoutlmv3 and layoutxlm☆43Updated 2 years ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆261Updated 3 years ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆279Updated 7 months ago
- Integrating ONgDB database into langchain ecosystem☆77Updated last year
- 基于scrapy的层次优先队列方法爬取中文维基百科,并自动抽取结构和半结构数据☆149Updated 2 years ago
- 利用指针网络进行信息抽取,包含命名实体识别、关系抽取、事件抽取。☆123Updated 2 years ago
- 政务公文知识图谱构建☆21Updated 2 years ago
- 关系抽取☆57Updated last year
- KgCLUE: 大规模中文开源知识图谱问答☆446Updated 2 years ago
- ☆26Updated this week
- 基于词汇信息融合的中文NER模型☆166Updated 3 years ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆191Updated 3 months ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆59Updated last year
- 中文世界的NLP自动标注开源工具,简单样本,交给LabelFast。☆70Updated 3 months ago
- 抽取式NLP模型(阅读理解模型,MRC)实现词义消歧(WSD)☆12Updated 2 years ago
- Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"☆33Updated last year