SkydustZ / AEC-domain-corporaLinks
The code and dataset for the paper "Pretrained Domain-Specific Language Model for General Information Retrieval Tasks in the AEC Domain"
☆24Updated 3 years ago
Alternatives and similar repositories for AEC-domain-corpora
Users that are interested in AEC-domain-corpora are comparing it to the libraries listed below
Sorting:
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆163Updated 2 years ago
- VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)☆194Updated 2 years ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆314Updated last year
- 中文世界的NLP自动标注开源工具,简单样本,交给LabelFast。☆81Updated last week
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆228Updated 2 years ago
- AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models☆449Updated 2 years ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆62Updated last year
- Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"☆46Updated 2 years ago
- 基于scrapy的层次优先队列方法爬取中文维基百科,并自动抽取结构和半结构数据☆157Updated 2 years ago
- 中文CLIP预训练模型☆419Updated 3 years ago
- QAonMilitaryKG,QaSystem based on military knowledge graph that stores in mongodb which is different from the previous one, 基于mongodb存储的军事…☆104Updated 6 years ago
- LLM for NER☆80Updated last year
- Minimal keyword extraction with BERT☆88Updated 4 years ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆120Updated 2 years ago
- PaddleNLP UIE模型的PyTorch版实现☆666Updated 2 years ago
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆176Updated 2 years ago
- KgCLUE: 大规模中文开源知识图谱问答☆453Updated 3 years ago
- chatglm-6B for tools application using langchain☆76Updated 2 years ago
- 针对建筑规范文本数据的知识图谱实体关系提取,知识图谱构建,检索增强生成DEMO☆33Updated last year
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆830Updated last year
- CDLA: A Chinese document layout analysis (CDLA) dataset☆287Updated 4 years ago
- 多模态中文LLaMA&Alpaca大语言模型(VisualCLA)☆458Updated 2 years ago
- Examples about using MGeo finetune models☆51Updated 2 years ago
- 使用python自动构建知识图谱,百万、千万、亿万级别☆45Updated 2 years ago
- ☆47Updated last year
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆43Updated 11 months ago
- ☆133Updated 2 years ago
- 学习开源chatGPT类模型的指南,汇总各种训练数据获取、模型微调、模型服务的方法,以及记录自己操作总遇到的各种常见坑,欢迎收藏、转发,希望能帮你省一些时间☆75Updated 2 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆169Updated 3 years ago
- 文档方向分类☆225Updated last year