SkydustZ / AEC-domain-corporaLinks
The code and dataset for the paper "Pretrained Domain-Specific Language Model for General Information Retrieval Tasks in the AEC Domain"
☆22Updated 2 years ago
Alternatives and similar repositories for AEC-domain-corpora
Users that are interested in AEC-domain-corpora are comparing it to the libraries listed below
Sorting:
- 使用python自动构建知识图谱,百万、千万、亿万级别☆43Updated 2 years ago
- Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"☆40Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆46Updated last year
- LLM for NER☆73Updated 11 months ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆304Updated 10 months ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具, 支持BertSoftmax、BertSpan等模型,开箱即用。☆114Updated last year
- 针对建筑规范文本数据的知识图谱实体关系提取,知识图谱构建,检索增强生成DEMO☆25Updated 10 months ago
- llama信息抽取实战☆100Updated 2 years ago
- 本项目使用大语言模型(LLM)进行开放领域三元组抽取。☆26Updated last year
- Integrating ONgDB database into langchain ecosystem☆77Updated 2 years ago
- 中文原生检索增强生成测评基准☆119Updated last year
- [ACL 2024] Official resources of "ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Larg…☆310Updated 10 months ago
- ☆249Updated 2 years ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆290Updated 9 months ago
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆226Updated 2 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆89Updated 4 months ago
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆175Updated 2 years ago
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆65Updated 11 months ago
- Universal information extraction with instruction learning☆388Updated 4 months ago
- 政务公文知识图谱构建☆21Updated 2 years ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆48Updated 9 months ago
- 中文世界的NLP自动标注开源工具,简单样本,交给LabelFast。☆73Updated 5 months ago
- ☆36Updated 2 months ago
- ☆28Updated this week
- graphrag的基础架构☆34Updated 8 months ago
- This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy☆39Updated 8 months ago
- 东南大学多模态知识图谱-OpenRichpedia工程文件☆30Updated 3 years ago
- ☆112Updated 11 months ago
- [IJCAI 2021] Document-level Relation Extraction as Semantic Segmentation☆146Updated 2 years ago
- TianGong-AI-Unstructure☆68Updated 2 weeks ago