Lafitte1573 / NLCorporaLinks
收集 NLP 领域的高质量中文数据集
☆35Updated 3 months ago
Alternatives and similar repositories for NLCorpora
Users that are interested in NLCorpora are comparing it to the libraries listed below
Sorting:
- ☆113Updated last year
- ☆79Updated 11 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆42Updated last year
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆279Updated 2 years ago
- text correction papers☆307Updated last year
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆107Updated 4 years ago
- Archive for AINLP History Article☆189Updated 3 years ago
- SimCSE有监督与无监督实验复现☆149Updated last year
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆88Updated 4 years ago
- 阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理☆113Updated last year
- Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension☆29Updated last year
- 基于T5模型的中文文本纠错☆32Updated 9 months ago
- 使用 Qwen2ForSequenceClassification 简单实现文本分类任务。☆75Updated last year
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- Source code for the paper "C-LLM: Learn to Check Chinese Spelling Errors Character by Character"☆25Updated 8 months ago
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆59Updated last year
- This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi☆96Updated 2 years ago
- 大模型文本分类☆73Updated 11 months ago
- Baichuan-13B 指令微调☆91Updated 2 years ago
- 用于汇总目前的开源中文对话数据集☆169Updated 2 years ago
- 中文文本纠错相关的论文、比赛和工具。☆61Updated 3 weeks ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆92Updated 5 months ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆553Updated 2 years ago
- 该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记(多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT)☆348Updated last year
- RAG 论文学习☆156Updated 4 months ago
- Rephrasing Language Model for CSC (AAAI 2024)☆41Updated last year
- LLM for NER☆78Updated last year
- 此项目完成了关于 NLP-Beginner:自然语言处理入门练习 的所有任务(文本分类、信息抽取、知识图谱、机器翻译、问答系统、文本生成、Text-to-SQL、文本纠错、文本挖掘、知识蒸馏、模型加速、OCR、TTS、Prompt、embedding等),所有代码都经过测试…☆208Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆81Updated last year
- 开源SFT数据集整理,随时补充☆533Updated 2 years ago