blcuicall / OMGEvalLinks
OMGEval๐ฎ: An Open Multilingual Generative Evaluation Benchmark for Foundation Models
โ35Updated last year
Alternatives and similar repositories for OMGEval
Users that are interested in OMGEval are comparing it to the libraries listed below
Sorting:
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Explorationโ36Updated last year
- โ28Updated 3 years ago
- โ89Updated 11 months ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Modelsโ117Updated 6 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)โ50Updated last year
- Official Implementation of "Probing Language Models for Pre-training Data Detection"โ20Updated last year
- an easy-to-use knn-mt toolkitโ106Updated 2 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseqโ29Updated 2 years ago
- Collection of papers for scalable automated alignment.โ94Updated last year
- code for Teaching LM to Translate with Comparisonโ39Updated 2 years ago
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).โ14Updated 2 years ago
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Templateโ22Updated 2 years ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationโ90Updated last year
- โ57Updated last year
- [ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPTโ91Updated 2 months ago
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Budโ22Updated last year
- ๐ฉบ A collection of ChatGPT evaluation reports on various bechmarks.โ50Updated 2 years ago
- [ACL 2023] Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?โ10Updated 2 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)โ17Updated last year
- โ23Updated 3 years ago
- The official repository for our EMNLP 2024 paper, Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretabโฆโ21Updated 9 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsโ58Updated last year
- ๐ An unofficial implementation of Self-Alignment with Instruction Backtranslation.โ138Updated 7 months ago
- โ146Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"โ136Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningโ168Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackโ40Updated 2 years ago
- The repository for paper <Evaluating Open-QA Evaluation>โ25Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningโ184Updated 5 months ago
- The official code of the 2023 ACL paper "Enhancing Grammatical Error Correction Systems with Explanations"โ28Updated 2 years ago