Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned with alignment with human preferences.
☆15Dec 30, 2024Updated last year
Alternatives and similar repositories for metametrics
Users that are interested in metametrics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- R3: Robust Rubric-Agnostic Reward Models☆22Jul 12, 2025Updated 11 months ago
- WorldCuisines is an extensive multilingual and multicultural benchmark that spans 30 languages, covering a wide array of global cuisines.…☆27May 8, 2025Updated last year
- URIEL+ knowledge base for natural language processing☆17May 5, 2026Updated last month
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆38Apr 7, 2026Updated 2 months ago
- ☆143Apr 8, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This project demonstrates function-calling with Python and Ollama, utilizing the Africa's Talking API to send airtime and messages to pho…☆19Jun 9, 2026Updated last week
- A library of translation-based text similarity measures☆25Dec 11, 2023Updated 2 years ago
- MonAPI: Democratizing API Monitoring Tools through Open Source☆28Jan 13, 2023Updated 3 years ago
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"☆20Oct 26, 2024Updated last year
- Multilingual Entity Linking model by BELA model☆12Jul 20, 2023Updated 2 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆12Feb 19, 2024Updated 2 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- ☆33Jun 20, 2018Updated 7 years ago
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SemEval 2019 Task 4: Hyperpartisan News Detection☆10Nov 9, 2019Updated 6 years ago
- Siamese graph convolutional network for content based remote sensing image retrieval☆14Sep 13, 2021Updated 4 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 9 months ago
- [2018-Runner Up Prizes for Bookdown contest] MSc Dissertation on "Spatial Generalized Linear Mixed Models and Its Applications" (China U…☆16Jun 28, 2020Updated 5 years ago
- [ICLR 2026] The official implementation of the paper “Anchored Supervised Fine-Tuning”☆44May 8, 2026Updated last month
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- Fast syllable estimation library based on pattern matching.☆41Mar 1, 2026Updated 3 months ago
- Fine-tuning open-source large and small instruct/chat language models (LLMs & SLMs) from the Hugging Face Model Hub using public datasets…☆19Aug 31, 2024Updated last year
- A powerful CLI application for automated AI-powered video generation. Create engaging short-form videos for TikTok, YouTube Shorts, and I…☆38Nov 28, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A repository for experiments in quality-aware decoding☆18Jun 7, 2022Updated 4 years ago
- Official repository of "Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion" (ACMMM 2024)☆16Oct 31, 2024Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆21Apr 9, 2025Updated last year
- ☆15Sep 30, 2023Updated 2 years ago
- MonAPI: Democratizing API Monitoring Tools through Open Source☆43Apr 23, 2024Updated 2 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 5 years ago
- This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO☆35Nov 26, 2025Updated 6 months ago
- Arabic Speech Recognition with Whisper: Fine-tune the Whisper model from OpenAI for Arabic speech recognition tasks. This repository prov…☆22Feb 28, 2024Updated 2 years ago
- ☆12Jun 30, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for CVPR2018 "Iterative Learning with Open-set Noisy Labels"☆12Mar 12, 2021Updated 5 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- ☆12Nov 28, 2022Updated 3 years ago
- Towards Few-Shot Fact-Checking via Perplexity☆13Jun 11, 2021Updated 5 years ago
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆87Jun 5, 2024Updated 2 years ago
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago