Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)
☆40May 16, 2025Updated 9 months ago
Alternatives and similar repositories for m-rewardbench
Users that are interested in m-rewardbench are comparing it to the libraries listed below
Sorting:
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"☆18Oct 26, 2024Updated last year
- ☆12Dec 14, 2023Updated 2 years ago
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆17Jun 5, 2025Updated 9 months ago
- ☆18Nov 25, 2022Updated 3 years ago
- The evaluation code for MultiIF multi-turn and multi-lingual instruction following☆60Oct 29, 2024Updated last year
- ☆20Jul 24, 2024Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆132Aug 21, 2024Updated last year
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Feb 24, 2026Updated last week
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 9 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆95Aug 15, 2023Updated 2 years ago
- ☆31Oct 15, 2021Updated 4 years ago
- An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset☆28Jan 19, 2025Updated last year
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- ☆33Aug 30, 2023Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆73Aug 24, 2025Updated 6 months ago
- DPO, but faster 🚀☆48Dec 6, 2024Updated last year
- ☆46Sep 29, 2025Updated 5 months ago
- ☆10Jan 4, 2023Updated 3 years ago
- GPU Optimization for Python☆10Mar 13, 2021Updated 4 years ago
- A flag generation AI created using DeepAIs API☆11Feb 8, 2022Updated 4 years ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- Cardiovascular Disease Classification Employing Empirical Mode Decomposition (EMD) of Modified ECG☆12Oct 6, 2023Updated 2 years ago
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆41Mar 7, 2025Updated 11 months ago
- code for Teaching LM to Translate with Comparison☆39Dec 15, 2023Updated 2 years ago
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆184May 20, 2025Updated 9 months ago
- ☆10Feb 12, 2024Updated 2 years ago
- ☆10Jan 20, 2024Updated 2 years ago
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- ☆11Jun 5, 2023Updated 2 years ago
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Updated this week
- ☆12Jan 17, 2025Updated last year
- ☆10Jun 5, 2025Updated 9 months ago
- Exposure-slot: Exposure-centric representations learning with Slot-in-Slot Attention for Region-aware Exposure Correction, Computer Visi…☆21Sep 2, 2025Updated 6 months ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 6 years ago
- Implementation of Generalized Cylinder Decomposition☆10Feb 9, 2018Updated 8 years ago