Lafitte1573 / NLCorporaLinks
收集 NLP 领域的高质量中文数据集
☆36Updated 4 months ago
Alternatives and similar repositories for NLCorpora
Users that are interested in NLCorpora are comparing it to the libraries listed below
Sorting:
- ☆114Updated last year
- ☆80Updated last year
- 使用 Qwen2ForSequenceClassification 简单实现文本分类任务。☆82Updated last year
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆284Updated 2 years ago
- 大模型文本分类☆79Updated last year
- text correction papers☆310Updated last year
- This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi☆98Updated 2 years ago
- Archive for AINLP History Article☆189Updated 3 years ago
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆88Updated 4 years ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆82Updated last year
- 阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理☆114Updated last year
- Python ROUGE Score Implementation for Chinese Language Task (official rouge score)☆107Updated last year
- 此项目完成了关于 NLP-Beginner:自然语言处理入门练习 的所有任务(文本分类、信息抽取、知识图谱、机器翻译、问答系统、文本生成、Text-to-SQL、文本纠错、文本挖掘、知识蒸馏、模型加速、OCR、TTS、Prompt、embedding等),所有 代码都经过测试…☆210Updated last year
- 该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记(多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT)☆351Updated last year
- 活字通用大模型☆393Updated 11 months ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆553Updated 2 years ago
- SimCSE有监督与无监督实验复现☆149Updated last year
- llama,chatglm 等模型的微调☆90Updated last year
- 基于T5模型的中文文本纠错☆32Updated 10 months ago
- PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese☆377Updated last year
- LLM for NER☆79Updated last year
- Source code for the paper "C-LLM: Learn to Check Chinese Spelling Errors Character by Character"☆26Updated 9 months ago
- Baichuan-13B 指令微调☆91Updated 2 years ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- RAG 论文学习☆167Updated 5 months ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆92Updated 6 months ago
- 一个中文心理健康支持问答数据集,提供了丰富的援助策略标注。可用于生成富有援助策略的长咨询文本。☆225Updated last year
- The repository of EMNLP 2023 "A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check"☆18Updated last year
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Updated last year
- 基于DPO算法微调语言大模型,简单好上手。☆43Updated last year