Jacob-Zhou / simple-csc
This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models".
☆67Updated last month
Alternatives and similar repositories for simple-csc:
Users that are interested in simple-csc are comparing it to the libraries listed below
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆84Updated 2 months ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆114Updated 4 months ago
- Rephrasing Language Model for CSC (AAAI 2024)☆41Updated 11 months ago
- 文本去重☆70Updated 10 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 大语言模型训练和服务调研☆37Updated last year
- ☆38Updated last year
- Imitate OpenAI with Local Models☆88Updated 7 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆79Updated 9 months ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆82Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- ☆97Updated last year
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆115Updated 2 months ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆68Updated 8 months ago
- ☆25Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆107Updated last year
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆33Updated 8 months ago
- ☆46Updated 10 months ago
- code for piccolo embedding model from SenseTime☆124Updated 11 months ago
- 专注于中文领域大语言模型,落地到某个行业某个领域,成为一个行业大模型、公司级别或行业级别领域大模型。☆118Updated last month
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆41Updated last year
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆61Updated 6 months ago
- 🌳CED: Catalog Extraction from Documents☆16Updated last year
- 介绍docker、docker compose的使用。☆20Updated 7 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆65Updated 2 years ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆191Updated 3 months ago
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆57Updated last year
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆18Updated 5 months ago