hccngu / ViscachaLinks
Viscacha:通用信息抽取数据集收集
☆26Updated last year
Alternatives and similar repositories for Viscacha
Users that are interested in Viscacha are comparing it to the libraries listed below
Sorting:
- Universal information extraction with instruction learning☆388Updated 4 months ago
- AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models☆444Updated last year
- 全局指针统一处理嵌套与非嵌套NER的Pytorch实现☆393Updated 2 years ago
- LLM for NER☆75Updated 11 months ago
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆174Updated 2 years ago
- Implemention of NER model on chinese dataset.☆73Updated 2 years ago
- experiments of some semantic matching models and comparison of experimental results.☆162Updated 2 years ago
- 基于词汇信息融合的中文NER模型☆169Updated 3 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆119Updated last year
- https://tianchi.aliyun.com/dataset/dataDetail?dataId=95414☆30Updated 3 years ago
- SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding☆225Updated last year
- 中文数据集下SimCSE+ESimCSE的实现☆192Updated 3 years ago
- 基于pytorch的百度UIE命名实体识别。☆56Updated 2 years ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆306Updated 11 months ago
- This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi☆95Updated 2 years ago
- llama信息抽取实战☆100Updated 2 years ago
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆299Updated 2 years ago
- GPLinker_pytorch☆82Updated 3 years ago
- 中文自然语言推理与语义相似度数据集☆359Updated 3 years ago
- Source code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification☆532Updated 3 years ago
- 本人项目进行中搜集的数据集,包含原始数据和经过处理后的数据,项目持续更新。☆115Updated 4 years ago
- text correction papers☆305Updated last year
- PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese☆372Updated last year
- ☆417Updated last year
- text embedding☆146Updated last year
- A simple framework for building some basic NLP tasks☆59Updated 2 years ago
- [SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval☆188Updated 2 years ago
- basic framework for rag(retrieval augment generation)☆85Updated last year
- ☆106Updated last year
- A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction☆27Updated 3 years ago