THUKElab / Visual-C3
Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters
☆16Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for Visual-C3
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Updated 11 months ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆16Updated 9 months ago
- Implementation of latent-GLAT (ACL-2022)☆32Updated 2 years ago
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆29Updated 9 months ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆48Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆38Updated last year
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆79Updated 7 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆35Updated last year
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆66Updated 5 months ago
- ☆53Updated 2 years ago
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template☆20Updated last year
- ☆59Updated last year
- A toolkit for evaluation of natural language generation (NLG), including BLEU, ROUGE, METEOR, and CIDEr.☆31Updated 4 years ago
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆29Updated last year
- ☆22Updated 2 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆34Updated 5 months ago
- A Chinese Spell Checking Model Released on EMNLP2022.☆20Updated last year
- Towards Systematic Measurement for Long Text Quality☆28Updated 2 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆53Updated 3 months ago
- Code and data of the paper "MCTS: A Multi-Reference Chinese Text Simplification Dataset".☆28Updated 5 months ago
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆10Updated 11 months ago
- NTK scaled version of ALiBi position encoding in Transformer.☆66Updated last year
- Hierarchical Context Tagger for utterance rewriting☆13Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆14Updated last year
- 擂台赛3-大规模预训练调优比赛的示例代码与baseline实现☆38Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆107Updated 3 months ago
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆32Updated 3 months ago
- The official implementation of ACL2022``Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer Networks''☆33Updated last year
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆11Updated last year
- Rephrasing Language Model for CSC (AAAI 2024)☆35Updated 5 months ago