This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Text Correction.
☆15Nov 25, 2023Updated 2 years ago
Alternatives and similar repositories for CCL2023-CLTC-THU_KELab
Users that are interested in CCL2023-CLTC-THU_KELab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- SIGIR 2022: Contrastive Learning with Hard Negative Entities for Entity Set Expansion☆30Jan 6, 2023Updated 3 years ago
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆16May 30, 2024Updated 2 years ago
- The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)☆12May 17, 2025Updated last year
- [AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities☆15Apr 26, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jun 11, 2024Updated last year
- 中山大学知识工程实验室介绍。☆40Aug 24, 2025Updated 9 months ago
- A Chinese Spell Checking Model Released on EMNLP2022.☆22Apr 14, 2023Updated 3 years ago
- 基于中心度的中文关键短语抽取工具☆11Sep 2, 2022Updated 3 years ago
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆47Aug 7, 2025Updated 9 months ago
- The repository of EMNLP 2023 "A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check"☆20Nov 17, 2023Updated 2 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆128Jan 30, 2026Updated 4 months ago
- ☆12Aug 31, 2022Updated 3 years ago
- Source code for the paper "Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granular…☆45Jun 15, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 5 years ago
- Exploration of semantic chunking and chunk classification☆19Sep 16, 2024Updated last year
- 基于seq2edit (Gector) 的中文文本纠错。☆29Nov 15, 2022Updated 3 years ago
- ☆26Oct 9, 2024Updated last year
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- INOFFICIAL nfdump with libnfread: library for reading netflow records from nfdump files☆13Jan 28, 2014Updated 12 years ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆83Aug 18, 2024Updated last year
- ☆271Jul 26, 2024Updated last year
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆23Jan 16, 2025Updated last year
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆97Feb 18, 2025Updated last year
- [ICML 2026] The official implementation of paper "Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation…☆72Updated this week
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆568Jun 9, 2023Updated 2 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55May 17, 2023Updated 3 years ago
- text correction papers☆315Jan 23, 2024Updated 2 years ago
- 基于pytorch的TPLinker_plus进行中文命名实体识别☆19May 14, 2023Updated 3 years ago
- ☆19Mar 14, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Corpus-based Set Expansion with Lexical Features and Distributed Representations (SIGIR '19)☆13Jul 18, 2019Updated 6 years ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- Use the famous language model, xlnet, to do sequence tagging/ sequence labelling/ named entity recognition(NER) / noun extraction;☆18Sep 30, 2019Updated 6 years ago
- Chrome extension for OA sites like arxiv, openreivew: 1. PDF back to abstract page, 2. Rename PDF page with paper title.☆18Oct 12, 2023Updated 2 years ago
- Pytorch Implementation of "Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation"☆26Nov 14, 2022Updated 3 years ago
- [NAACL 2025 Findings] Code for "Perception Compressor: A Training-Free Prompt Compression Framework in Long Context Scenarios"☆28Mar 5, 2025Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago