A unified evaluation toolkit and leaderboard for rigorously assessing the scientific intelligence of large language and vision–language models across the full research workflow.
☆78Apr 3, 2026Updated last week
Alternatives and similar repositories for SciEvalKit
Users that are interested in SciEvalKit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]☆64Aug 20, 2021Updated 4 years ago
- ☆14Jan 6, 2025Updated last year
- ☆12Jan 14, 2026Updated 2 months ago
- This is a joint project between Helmholtz Imaging (located at DKFZ) and Lin Yang and Otmar Schmid (Helmholtz Munich).☆13Nov 6, 2024Updated last year
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆48May 12, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)☆15May 2, 2025Updated 11 months ago
- Official repository for CoTran: An LLM-based code translator for whole-program translation, fine-tuned using feedback from compiler and s…☆15Nov 6, 2024Updated last year
- [ACL 2025] Multi-Agent System for Science of Science☆67Jul 26, 2025Updated 8 months ago
- MICCAI 2023: Morphology-inspired Unsupervised Gland Segmentation via Selective Semantic Grouping☆10Mar 5, 2026Updated last month
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆28Sep 1, 2025Updated 7 months ago
- ☆11Jan 8, 2025Updated last year
- Official code repo for paper: ACROSS: An Alignment-based Framework for Low-Resource Many-to-One Cross-Lingual Summarization☆12Jul 15, 2023Updated 2 years ago
- Demos of mutation testing and fuzz testing prepared for the Software Testing Course of NJU Software Institute.☆12Dec 14, 2023Updated 2 years ago
- ☆10May 10, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Jul 24, 2024Updated last year
- Sci. Rep. 2025 | Revisiting model scaling with a U-net benchmark for 3D medical image segmentation☆18Aug 21, 2025Updated 7 months ago
- A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)☆447Oct 3, 2025Updated 6 months ago
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Jul 1, 2025Updated 9 months ago
- A collection of state-of-the-art single image super resolution methods.☆13Apr 26, 2021Updated 4 years ago
- Offical Code of MICCAI'25 Best-Paper-Shortlist paper "MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group…☆38Sep 28, 2025Updated 6 months ago
- (CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…☆24Apr 6, 2025Updated last year
- [MICCAI, 2022, Student Travel Award]: CFDA_Collaborative Feature Disentanglement and Augmentation for Pulmonary Airway Tree Modeling of C…☆13Dec 22, 2023Updated 2 years ago
- ☆21Dec 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Voxel-based Network for Shape Completion by Leveraging Edge Generation (ICCV 2021, oral)☆17Dec 4, 2022Updated 3 years ago
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆21Oct 11, 2022Updated 3 years ago
- 基于MATLAB和C51的微信跳一跳单片机物理外挂设计☆10Apr 17, 2018Updated 7 years ago
- ☆14Dec 9, 2023Updated 2 years ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆65Dec 10, 2025Updated 4 months ago
- CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale☆128Dec 6, 2025Updated 4 months ago
- ☆15Mar 11, 2023Updated 3 years ago
- A nnU-Netv2 based acceleration solution for Abdominal organs and tumor segmentation.☆20Nov 24, 2023Updated 2 years ago
- MC-CoT implementation code☆22Jun 24, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ACM MM 2023] Mask-Guided Progressive Network for Joint Raindrop and Rain Streak Removal in Videos☆18Jul 22, 2024Updated last year
- Office implementation of "3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation", ICLR 2024☆12Nov 5, 2024Updated last year
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆55Updated this week
- OrqueIO main source code repository☆24Mar 26, 2026Updated 2 weeks ago
- Generative turbulence model TurbDiff as proposed in "From Zero to Turbulence: Generative Modeling for 3D Flow Simulation", ICLR 2024☆34Dec 7, 2025Updated 4 months ago
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆86Jun 4, 2025Updated 10 months ago
- Code for [Pattern Recognition] Prompt Learning based Source-free Domain Adaptation for Medical Image Segmentation.☆30Apr 22, 2025Updated 11 months ago