HeartBench is an evaluation benchmark for the psychological and social sciences field, designed to transcend traditional knowledge and reasoning assessments. It focuses on measuring large language models' (LLMs) anthropomorphic capabilities in human-computer interactions, covering dimensions such as personality, emotion, social skills, and ethic…
☆46Jan 7, 2026Updated 4 months ago
Alternatives and similar repositories for HeartBench
Users that are interested in HeartBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Aug 13, 2025Updated 8 months ago
- Short RL☆18Apr 16, 2026Updated 3 weeks ago
- ☆11Jun 21, 2025Updated 10 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 8 months ago
- 基于Model Context Protocol (MCP)的ComfyUI图像生成服务,通过API调用本地ComfyUI实例生成图片,实现自然语言生图自由☆23Nov 30, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Feb 15, 2024Updated 2 years ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆61Apr 23, 2026Updated last week
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆37Sep 18, 2025Updated 7 months ago
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆56Dec 23, 2025Updated 4 months ago
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…☆13Sep 13, 2024Updated last year
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach☆21Nov 17, 2025Updated 5 months ago
- ☆48Feb 26, 2025Updated last year
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆16Jan 9, 2026Updated 3 months ago
- ☆12Sep 23, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning☆18Feb 21, 2025Updated last year
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated 2 years ago
- This repository includes the introduction to uncertain label in Chest X-Ray diagnosis.☆10Oct 20, 2024Updated last year
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆23Aug 28, 2025Updated 8 months ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆18Jul 22, 2024Updated last year
- ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions☆11May 17, 2024Updated last year
- The code of paper "MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning" accep…☆10Mar 5, 2024Updated 2 years ago
- [MICCAI‘25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment☆20Feb 27, 2026Updated 2 months ago
- Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight☆13May 26, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆18Nov 11, 2022Updated 3 years ago
- ☆17Sep 23, 2024Updated last year
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 4 months ago
- Official implementation of MICCAI2023【Knowledge Boosting: Rethinking Medical Contrastive Vision-Langauge Pre-training】☆16Mar 19, 2024Updated 2 years ago
- implementation of dualformer☆25Mar 1, 2025Updated last year
- ☆25Mar 4, 2025Updated last year
- ☆17Aug 5, 2025Updated 9 months ago
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆27Apr 9, 2026Updated 3 weeks ago
- RadGraph: Extracting Clinical Entities and Relations from Radiology Reports☆14Nov 22, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Improving Medical Vision-Language Contrastive Pretraining with Semantics-aware Triage☆11Jun 25, 2023Updated 2 years ago
- [CVPR'25] Conformal prediction for vision-language models. Enhancing VLMs deployment with reliability gurarantees.☆21Jun 7, 2025Updated 10 months ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆29Mar 18, 2026Updated last month
- official implementation of "Med-Unic: unifying cross-lingual medical vision-language pre-training by diminishing bias"☆17Sep 22, 2023Updated 2 years ago
- ☆23Nov 27, 2025Updated 5 months ago
- Breast Cancer Detection using Mask-rcnn on the inbreast dataset☆13Dec 13, 2023Updated 2 years ago
- [ICCVW'23] Robust Asymmetric Loss for Multi-Label Long-Tailed Learning☆18Oct 3, 2023Updated 2 years ago