blcuicall / OMGEvalLinks
OMGEval๐ฎ: An Open Multilingual Generative Evaluation Benchmark for Foundation Models
โ35Updated last year
Alternatives and similar repositories for OMGEval
Users that are interested in OMGEval are comparing it to the libraries listed below
Sorting:
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Explorationโ36Updated last year
- โ28Updated 3 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseqโ29Updated 2 years ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Modelsโ118Updated 6 months ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"โ20Updated last year
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)โ50Updated last year
- an easy-to-use knn-mt toolkitโ106Updated 2 years ago
- โ58Updated last year
- Collection of papers for scalable automated alignment.โ94Updated last year
- code for Teaching LM to Translate with Comparisonโ39Updated 2 years ago
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Templateโ22Updated 2 years ago
- ๐ฉบ A collection of ChatGPT evaluation reports on various bechmarks.โ50Updated 2 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"โ136Updated last year
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Budโ22Updated last year
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)โ17Updated last year
- โ89Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationโ90Updated last year
- [ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPTโ91Updated 2 months ago
- [ACL 2023] kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translationโ17Updated 2 years ago
- โ23Updated 3 years ago
- โ147Updated last year
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citationsโ13Updated last year
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"โ59Updated 3 years ago
- The repository for paper <Evaluating Open-QA Evaluation>โ25Updated last year
- ๐ An unofficial implementation of Self-Alignment with Instruction Backtranslation.โ138Updated 8 months ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)โ31Updated 2 months ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackโ40Updated 2 years ago
- A Code System for Grammar Error Correction Method. Code Repo for ACL 24 Main "Detection-Correction Structure via General Language Model fโฆโ22Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningโ184Updated 6 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsโ58Updated last year