blcuicall / OMGEvalLinks
OMGEval๐ฎ: An Open Multilingual Generative Evaluation Benchmark for Foundation Models
โ35Updated last year
Alternatives and similar repositories for OMGEval
Users that are interested in OMGEval are comparing it to the libraries listed below
Sorting:
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Explorationโ36Updated last year
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)โ49Updated last year
- ๐ฉบ A collection of ChatGPT evaluation reports on various bechmarks.โ50Updated 2 years ago
- โ56Updated last year
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Modelsโ114Updated 4 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"โ135Updated last year
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)โ17Updated last year
- an easy-to-use knn-mt toolkitโ104Updated 2 years ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"โ20Updated 10 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationโ89Updated 10 months ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseqโ29Updated 2 years ago
- Collection of papers for scalable automated alignment.โ93Updated 11 months ago
- code for Teaching LM to Translate with Comparisonโ39Updated last year
- โ29Updated 2 years ago
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Templateโ22Updated 2 years ago
- โ145Updated last year
- Source code for paper "A Two-Stage Method for Chinese AMR Parsing" @ CAMRP-2022 & CCL-2022โ24Updated last year
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Budโ21Updated last year
- โ84Updated 9 months ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackโ40Updated 2 years ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)โ95Updated 7 months ago
- ๐ผ Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Expertsโ40Updated last year
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsโ56Updated last year
- โ23Updated 2 years ago
- โ96Updated last year
- The official code of the 2023 ACL paper "Enhancing Grammatical Error Correction Systems with Explanations"โ28Updated 2 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractorsโ38Updated 8 months ago
- [ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPTโ89Updated 3 weeks ago
- ๐ An unofficial implementation of Self-Alignment with Instruction Backtranslation.โ139Updated 5 months ago
- โ33Updated last year