SkydustZ / AEC-domain-corporaLinks
The code and dataset for the paper "Pretrained Domain-Specific Language Model for General Information Retrieval Tasks in the AEC Domain"
☆22Updated 2 years ago
Alternatives and similar repositories for AEC-domain-corpora
Users that are interested in AEC-domain-corpora are comparing it to the libraries listed below
Sorting:
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆308Updated last year
- VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)☆193Updated 2 years ago
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆226Updated 2 years ago
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆159Updated last year
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆57Updated 11 months ago
- Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"☆42Updated 2 years ago
- Baichuan-13B 指令微调☆91Updated 2 years ago
- SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding☆226Updated last year
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆173Updated 2 years ago
- kbqa,langchain,large langauge model, chatgpt☆81Updated 10 months ago
- 针对建筑规范文本数据的知识图谱实体关系提取,知识图谱构建,检索增强生成DEMO☆31Updated last year
- 基于GlobalPointer的实体/关系/事件抽取☆148Updated 3 years ago
- PaddleNLP UIE模型的PyTorch版实现☆645Updated 2 years ago
- 多模态中文LLaMA&Alpaca大语言模型(VisualCLA)☆451Updated 2 years ago
- Examples about using MGeo finetune models☆48Updated 2 years ago
- KgCLUE: 大规模中文开源知识图谱问答☆448Updated 3 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆115Updated last year
- ☆257Updated 2 years ago
- AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models☆447Updated last year
- Universal information extraction with instruction learning☆391Updated 6 months ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆253Updated 2 years ago
- llama信息抽取实战☆100Updated 2 years ago
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆824Updated last year
- 基于scrapy的层次优先队列方法爬取中文维基百科,并自动抽取结构和半结构数据☆156Updated 2 years ago
- We released BERT-wwm, a Chinese pre-training model based on Whole Word Masking technology, and models closely related to this technology.…☆62Updated 2 years ago
- 📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解☆50Updated 2 years ago
- Unified Structure Generation for Universal Information Extraction☆938Updated 3 years ago
- 中文CLIP预训练模型☆417Updated 2 years ago
- 基于pytorch_bert的中文多标签分类☆91Updated 3 years ago
- 本项目使用大语言模型(LLM)进行开放领域三元组抽取。☆29Updated last year