TW-NLP / ChineseErrorCorrector
中文拼写错误和语法错误纠正
☆109Updated 2 weeks ago
Alternatives and similar repositories for ChineseErrorCorrector:
Users that are interested in ChineseErrorCorrector are comparing it to the libraries listed below
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆276Updated 7 months ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆534Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆300Updated 8 months ago
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆57Updated 8 months ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆44Updated 7 months ago
- 一个简单快速的分词、命名实体识别工具☆576Updated 2 weeks ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆84Updated last month
- "桃李“: 国际中文教育大模型☆177Updated last year
- SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding☆223Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆192Updated 3 weeks ago
- ☆321Updated 10 months ago
- 基于pytorch的中文意图识别和槽位填充☆170Updated 9 months ago
- 对深度学习中的NLP进行解释和代码使用☆48Updated last year
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆66Updated last month
- BERT-based intent and slots detector for chatbots.☆173Updated last month
- 中文原生检索增强生成测评基准☆115Updated 11 months ago
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆223Updated last year
- text correction papers☆303Updated last year
- 基于大语言模型的检索增强生成RAG示例☆142Updated 4 months ago
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆31Updated 9 months ago
- ☆139Updated 10 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集☆195Updated 6 months ago
- 供AI训练的中文数据集(持续更新。。。)与AI公司图谱,目前的数据集餐饮行业8000问,百度知道,Alpaca中文数据集,计算机领域数据集,Vicuna数据集,RedPajama数据集,Wikipedia中文词条数据集,网站论坛问答数据集☆56Updated last year
- A simple, easy-to-hack GraphRAG implementation☆13Updated 6 months ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆107Updated last year
- MiniRBT (中文小型预训练模型系列)☆274Updated 2 years ago
- 语言模型中文认知能力分析☆236Updated last year
- 文档方向分类☆216Updated 4 months ago
- 基于bert进行中文文本纠错☆233Updated last year
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆38Updated 3 weeks ago