雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
☆316Aug 8, 2024Updated last year
Alternatives and similar repositories for YAYI-UIE
Users that are interested in YAYI-UIE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Universal information extraction with instruction learning☆394Feb 28, 2025Updated last year
- An Open-sourced Knowledgable Large Language Model Framework.☆1,376Jan 11, 2025Updated last year
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆212Jan 9, 2025Updated last year
- Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)☆1,057Nov 18, 2024Updated last year
- A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)☆10May 8, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆55Nov 4, 2024Updated last year
- Unified Structure Generation for Universal Information Extraction☆956Jul 30, 2022Updated 3 years ago
- PaddleNLP UIE模型的PyTorch版实现☆686Aug 13, 2023Updated 2 years ago
- Viscacha:通用信息抽取数据集收集☆27Feb 21, 2024Updated 2 years ago
- [EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction☆4,356Jul 19, 2025Updated 8 months ago
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆828May 28, 2024Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆307Sep 10, 2024Updated last year
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆643Apr 22, 2024Updated last year
- This repository contains code and data for the paper "TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured T…☆28Jun 12, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…☆2,413Sep 29, 2023Updated 2 years ago
- Forked from *OneIE: A Joint Neural Model for Information Extraction with Global Features*☆21Sep 4, 2022Updated 3 years ago
- ☆98Mar 20, 2024Updated 2 years ago
- 中文命名实体识别。包含目前最新的中文命名实体识别论文、中文实体识别相关工具、数据集,以及中文预训练模型、词向量、实体识别综述等。☆763Jul 4, 2025Updated 8 months ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆409Dec 23, 2024Updated last year
- Code and data of WSDM 2023 paper "Hansel: A Chinese Few-Shot and Zero-Shot Entity Linking Benchmark".☆23Jun 1, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,652Oct 24, 2024Updated last year
- Retrieval and Retrieval-augmented LLMs☆11,443Mar 10, 2026Updated 2 weeks ago
- chinese document classification of layoutlmv3 and layoutxlm☆46Oct 25, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repository for training LLaMa 2 models using the NERRE format.☆65Dec 22, 2023Updated 2 years ago
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆139Mar 1, 2024Updated 2 years ago
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆42Jul 10, 2024Updated last year
- PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese☆391Jan 23, 2024Updated 2 years ago
- Guideline following Large Language Model for Information Extraction☆433Oct 27, 2024Updated last year
- An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extrac…☆134Jan 17, 2024Updated 2 years ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆159Jul 25, 2025Updated 8 months ago
- SFT+RL boosts multimodal reasoning☆47Jun 27, 2025Updated 9 months ago
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆177May 24, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated 11 months ago
- ☆979Feb 7, 2025Updated last year
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,782Dec 12, 2023Updated 2 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263May 9, 2024Updated last year
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,198Jun 19, 2024Updated last year
- [Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering☆192Jun 10, 2024Updated last year
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆86Jul 9, 2025Updated 8 months ago