UnstoppableCurry / High-quality-Chinese-Q-A-dataset
最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM
☆9Updated last year
Alternatives and similar repositories for High-quality-Chinese-Q-A-dataset:
Users that are interested in High-quality-Chinese-Q-A-dataset are comparing it to the libraries listed below
- deep training task☆29Updated last year
- 基于pytorch的百度UIE命名实体识别。☆57Updated 2 years ago
- using lear to do ner extraction☆29Updated 2 years ago
- 基于PaddleNLP开源的抽取式UIE进行医学命名实体识别(torch实现)☆44Updated 2 years ago
- llama信息抽取实战☆98Updated last year
- [Unofficial] Predict code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification☆52Updated 2 years ago
- 基于simcse的中文句向量生成☆15Updated 2 years ago
- 基于pytorch的GlobalPointer进行中文命名实体识别。☆36Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆105Updated last year
- LLM for NER☆62Updated 6 months ago
- ☆57Updated 2 years ago
- Viscacha:通用信息抽取数据集收集☆27Updated last year
- 支持ChatGLM2 lora微调☆39Updated last year
- ccks金融事件主体抽取☆72Updated 4 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆62Updated 3 years ago
- 基于 pytorch 的 bert 实现和下游任务微调☆48Updated 2 years ago
- pytorch Efficient GlobalPointer☆53Updated 2 years ago
- https://tianchi.aliyun.com/dataset/dataDetail?dataId=95414☆30Updated 3 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆97Updated 2 years ago
- A simple framework for building some basic NLP tasks☆59Updated 2 years ago
- ☆13Updated 3 years ago
- 中文标注工具,支持NER、文本分类、关系标注、对话标注等。☆66Updated 6 months ago
- ☆23Updated last year
- deepspeed+trainer简单高效实现多卡微调大模型☆122Updated last year
- 使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征,提升下游任务的表现。☆41Updated last year
- benchmark of KgCLUE, with different models and methods☆27Updated 3 years ago
- RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。☆48Updated 2 years ago
- 基于词汇信息融合的中文NER模型☆164Updated 2 years ago
- GPLinker_pytorch☆80Updated 2 years ago
- CCKS2020 面向中文短文本的实体链指任务。主要思路为:使用基于BiLSTM和Attention的语义模型进行Query和Doc的文本匹配,再针对匹配度进行pairwise排序,从而选出最优的知识库实体。☆47Updated 3 years ago