blcuicall / OMGEval
OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models
☆32Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for OMGEval
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆16Updated 9 months ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆48Updated last year
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆28Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆38Updated last year
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆16Updated 5 months ago
- code for Teaching LM to Translate with Comparison☆37Updated 10 months ago
- ☆14Updated last year
- Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"☆86Updated last week
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆25Updated 11 months ago
- ☆11Updated 2 years ago
- ☆47Updated 2 months ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆81Updated last year
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆10Updated this week
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆32Updated 8 months ago
- ☆46Updated 4 months ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆19Updated last year
- ☆15Updated 9 months ago
- ☆22Updated last year
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆34Updated 5 months ago
- Towards Systematic Measurement for Long Text Quality☆28Updated 2 months ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆46Updated 3 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆44Updated 6 months ago
- A Code System for Grammar Error Correction Method. Code Repo for ACL 24 Main "Detection-Correction Structure via General Language Model f…☆12Updated last month
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template☆20Updated last year
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆79Updated 7 months ago
- ☆42Updated 11 months ago
- Codes and data for ACL 2023 Findings paper "Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning"☆15Updated 8 months ago
- ☆23Updated last year
- CDQA: Chinese Dynamic Question Answering Benchmark☆14Updated 8 months ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆22Updated 3 months ago