Flexible evaluation tool for language models
☆59Jun 3, 2026Updated last week
Alternatives and similar repositories for flexeval
Users that are interested in flexeval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆17Jul 1, 2021Updated 4 years ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆126Apr 10, 2026Updated 2 months ago
- A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus☆10Jun 26, 2024Updated last year
- ☆30Apr 10, 2025Updated last year
- DefSent: Sentence Embeddings using Definition Sentences☆23Aug 5, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆23Apr 24, 2024Updated 2 years ago
- Japanese instruction data (日本語指示データ)☆24Jul 13, 2023Updated 2 years ago
- ☆16Nov 19, 2023Updated 2 years ago
- Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)☆20Jun 17, 2025Updated 11 months ago
- ☆35Dec 17, 2020Updated 5 years ago
- 生成自動評価を行うためのPythonツール☆43Mar 18, 2026Updated 2 months ago
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆58Sep 22, 2024Updated last year
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆90Mar 16, 2026Updated 2 months ago