[EMNLP 2025] A real-world clinical benchmark for medical LLMs with physician validation — 2,996 questions from EHRs
☆26Apr 15, 2026Updated 2 weeks ago
Alternatives and similar repositories for LLMEval-Med
Users that are interested in LLMEval-Med are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM evaluation on 2024 Chinese Gaokao Mathematics — zero-contamination benchmark with dual prompt formats☆19Apr 15, 2026Updated 2 weeks ago
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 8 months ago
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆17Jun 6, 2024Updated last year
- ☆139Apr 14, 2026Updated 3 weeks ago
- This is the 2024 OS lab repository.☆11Jun 27, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ESWC '24] This repo is official implementation for the paper "Towards Harnessing Large Language Models as Autonomous Agents for Semantic…☆10May 25, 2024Updated last year
- Extract corpora from Wikipedia dumps☆26Mar 26, 2019Updated 7 years ago
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆28Apr 23, 2026Updated last week
- The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"☆20Dec 24, 2024Updated last year
- The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Pos…☆71Sep 29, 2025Updated 7 months ago
- 2024年北航os课程仓库,不同分支包含不同lab的代码,以及笔记、思考题作业等☆25Jul 13, 2024Updated last year
- DoctorRAG is a medical AI that mimics doctor-like reasoning by combining textbook knowledge with insights from similar patient cases, usi…☆21May 21, 2025Updated 11 months ago
- D.Com 학우들을 위한 커리어 조언 Repo☆12May 17, 2023Updated 2 years ago
- Repository for the research work "Ontology Generation using Large Language Models", presented at ESWC 2025.☆34Aug 15, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe☆192Updated this week
- ☆22Sep 1, 2025Updated 8 months ago
- Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics☆13Mar 6, 2025Updated last year
- About Code release for "FlashBias: Fast Computation of Attention with Bias" (NeurIPS 2025), https://arxiv.org/abs/2505.12044☆28Nov 17, 2025Updated 5 months ago
- An open-source alternative to v0.dev. Cost-effective, highly customizable, and seamlessly integrated within GitHub.☆32Jan 24, 2024Updated 2 years ago
- 这是一个票据自动识别处理的仓库,希望对有类似业务需求的同学有借鉴意义☆39Apr 14, 2023Updated 3 years ago
- Using Seq2Seq transformers for Text2SQL task on WikiSQL dataset.☆12Jan 8, 2022Updated 4 years ago
- ☆17Dec 31, 2023Updated 2 years ago
- Papers and codes of Physics-informed Deep Compositional Operator Network☆13Oct 31, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 北航场馆预约系统Python+Selenium自动化脚本☆33Apr 27, 2022Updated 4 years ago
- Latent Knowledge-Guided Video Diffusion for Scientific Phenomena Generation from a Single Initial Frame☆17Updated this week
- MDRDC dataset and used baselines☆12Feb 20, 2023Updated 3 years ago
- We present cod-bench containing 12 operators and 10 datasets.☆11Jun 5, 2024Updated last year
- ☆13Feb 14, 2024Updated 2 years ago
- Code for "Holistic Physics Solver: Learning PDEs in a Unified Spectral-Physical Space"☆24Mar 25, 2026Updated last month
- ☆15Jul 18, 2025Updated 9 months ago
- This repository contains the code for the paper: Deciphering and integrating invariants for neural operator learning with various physica…☆13Mar 18, 2024Updated 2 years ago
- [ACL 2026] A large-scale longitudinal study on robust and fair evaluation of LLMs — 200K+ generative questions across 13 disciplines☆37Apr 13, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Aug 22, 2024Updated last year
- ☆15Mar 6, 2024Updated 2 years ago
- Basic setup and easy to follow templates to interact and search CogStack for data analysts☆12Sep 18, 2025Updated 7 months ago
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆151Apr 26, 2026Updated last week
- my own project☆37Jul 15, 2024Updated last year
- ☆20Aug 14, 2025Updated 8 months ago
- BUAA OS Lab "MOS" Open Source Repository | 北航操作系统课程 MOS 内核实验开源代码仓库☆58May 2, 2025Updated last year