Evaluation Pipeline for medical tasks.
☆12Feb 13, 2026Updated 3 weeks ago
Alternatives and similar repositories for med-eval
Users that are interested in med-eval are comparing it to the libraries listed below
Sorting:
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆11Oct 18, 2021Updated 4 years ago
- 最新LLMの一覧を作成します☆20Feb 1, 2026Updated last month
- AASC: ACL Anthology Sentence Corpus☆20Oct 28, 2020Updated 5 years ago
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆56Sep 22, 2024Updated last year
- ☆30Jun 11, 2021Updated 4 years ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Oct 14, 2024Updated last year
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- 【2024年版】BERTによるテキスト分類☆30Jul 8, 2024Updated last year
- LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, fi…☆37May 2, 2024Updated last year
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆38Oct 7, 2025Updated 5 months ago
- ☆31Mar 24, 2023Updated 2 years ago
- Logical inference system based on event semantics and degree semantics in formal semantics☆11Jan 22, 2023Updated 3 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- 日本語への翻訳でコントリビュートできるGitHubのリポジトリリスト☆36Dec 31, 2025Updated 2 months ago
- Winning solution of the Microsoft Research "First TextWorld Problems: A Reinforcement and Language Learning Challenge"☆12Jun 21, 2022Updated 3 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- CDbw Index For Cluster Validation☆10Mar 26, 2019Updated 6 years ago
- This repository contains codes for *Sem 2023 paper “Generative Data Augmentation for Aspect Sentiment Quad Prediction”.☆11May 30, 2023Updated 2 years ago
- WaPENの文法をPythonっぽくしたもの☆14Updated this week
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Jan 25, 2026Updated last month
- ProxyExplainer for Graph Neural Networks☆15Oct 24, 2024Updated last year
- 第八届“泰迪杯”数据挖掘挑战赛的一点心得☆10Nov 26, 2020Updated 5 years ago
- The main controller for services in the cs-insights project through docker-compose.☆13Aug 25, 2023Updated 2 years ago
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Aug 10, 2023Updated 2 years ago
- ☆11Nov 8, 2023Updated 2 years ago
- Classification of human emotion using multi-modal models☆12Jun 27, 2020Updated 5 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!