blcuicall / OMGEvalLinks
OMGEval๐ฎ: An Open Multilingual Generative Evaluation Benchmark for Foundation Models
โ35Updated last year
Alternatives and similar repositories for OMGEval
Users that are interested in OMGEval are comparing it to the libraries listed below
Sorting:
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)โ49Updated last year
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Explorationโ36Updated last year
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Modelsโ110Updated 2 months ago
- โ29Updated 2 years ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"โ19Updated 8 months ago
- Collection of papers for scalable automated alignment.โ93Updated 10 months ago
- โ56Updated last year
- ๐ฉบ A collection of ChatGPT evaluation reports on various bechmarks.โ50Updated 2 years ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)โ30Updated last year
- โ33Updated last year
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMsโ38Updated last year
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)โ17Updated last year
- A retrieval augmented sequence modeling toolkit implemented based on Fairseqโ29Updated 2 years ago
- code for Teaching LM to Translate with Comparisonโ39Updated last year
- โ81Updated 8 months ago
- an easy-to-use knn-mt toolkitโ104Updated 2 years ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationโ88Updated 9 months ago
- โ145Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"โ132Updated last year
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractorsโ37Updated 6 months ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackโ40Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planningโ36Updated 2 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsโ56Updated last year
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).โ14Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?โ83Updated last year
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Budโ21Updated last year
- [ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPTโ89Updated last year
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasksโ99Updated 2 years ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)โ91Updated 6 months ago
- [ACL 2023] kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translationโ17Updated 2 years ago