☆29Nov 14, 2025Updated 4 months ago
Alternatives and similar repositories for tower-eval
Users that are interested in tower-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Aug 23, 2024Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- ☆11Jul 24, 2024Updated last year
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated last month
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆128Oct 13, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2…☆20Jul 19, 2023Updated 2 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- The Open Multilingual Wordnet☆72May 6, 2024Updated last year
- A library for minimum Bayes risk (MBR) decoding☆52Nov 2, 2025Updated 5 months ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 10 months ago
- Neural discourse structure for text categorization☆12Aug 27, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- personalized-llms with allen institute☆14Jun 22, 2023Updated 2 years ago
- Universal Semantic Annotator (LREC 2022)☆18Jan 29, 2025Updated last year
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 2 months ago
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆15May 10, 2023Updated 2 years ago
- ☆36Mar 26, 2022Updated 4 years ago
- SocialDial: A Benchmark for Socially-Aware Dialogue Systems (SIGIR'23)☆16Aug 4, 2023Updated 2 years ago
- ☆138Jan 22, 2026Updated 2 months ago
- ☆10Aug 31, 2023Updated 2 years ago
- Code for paper "Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech Tagging"☆16May 31, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Library for pruning experts per language pair in NLLB-200☆34Jul 7, 2023Updated 2 years ago
- ☆13Jul 13, 2018Updated 7 years ago
- State-of-the-art LLM-based translation models.☆582Apr 9, 2025Updated last year
- Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"☆14Oct 10, 2022Updated 3 years ago
- LossHub: Loss Functions Library for Image Classification and Detection☆14Oct 9, 2022Updated 3 years ago
- A Windows program to view/examine XLIFF file contents.☆13Sep 26, 2024Updated last year
- TAUS Dynamic Quality Framework API☆11Sep 17, 2020Updated 5 years ago
- Torchreid-Pip: Packaged version of Torchreid☆13Oct 16, 2022Updated 3 years ago
- A Chinese sentiment analyze lib with Python☆15Dec 17, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Train Gradient Boosting models that are both high-performance *and* Fair!☆106Mar 11, 2026Updated last month
- ☆20Mar 12, 2025Updated last year
- A large Chinese sentiment lexicon consist of 8000 words☆24Oct 31, 2012Updated 13 years ago
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated 10 months ago
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Oct 7, 2024Updated last year
- ☆15Dec 26, 2024Updated last year
- OcSort-Pip: Packaged version of the OcSort repository☆17Jan 6, 2023Updated 3 years ago