HICAI-ZJU / SciKnowEvalView external linksLinks
SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models
☆26Jul 13, 2025Updated 7 months ago
Alternatives and similar repositories for SciKnowEval
Users that are interested in SciKnowEval are comparing it to the libraries listed below
Sorting:
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆12Oct 12, 2024Updated last year
- SciAssess is a comprehensive benchmark for evaluating Large Language Models' proficiency in scientific literature analysis across various…☆83May 21, 2025Updated 8 months ago
- Jupyter Notebooks for testing the impact of tip incentives for ChatGPT☆22Feb 23, 2024Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated last year
- R package to perform mechanistic & metabolic constrained species range simulations☆12Jul 9, 2025Updated 7 months ago
- A comprehensive command-line based pipeline for the analysis of direct injection FT-ICR mass spectrometry data☆12Feb 3, 2026Updated last week
- Computes predicted risk for atherosclerotic cardiovascular disease.☆11Nov 11, 2025Updated 3 months ago
- An R package to produce full spectrum flow cytometry plots outside the acquisition software.☆11May 9, 2023Updated 2 years ago
- Command-line tools to support meta-analysis using a library managed in Zotero☆11Feb 9, 2023Updated 3 years ago
- Data and program code for meta-analyses of population health and health services research questions☆20Feb 20, 2014Updated 11 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated 11 months ago
- Admiral Package Extension for Pediatric Clinical Trials☆13Jan 26, 2026Updated 2 weeks ago
- ☆12Jan 11, 2026Updated last month
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- NeRF implementation with minimal code and maximal readability using PyTorch☆11Aug 27, 2022Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- An R package for visualizing confounder control in meta-analyses☆11Jan 17, 2023Updated 3 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- Quickly scaffold powerful web components with Yeoman☆13Feb 26, 2018Updated 7 years ago
- An open source code of the GitHub Copilot Workspace☆12Jun 8, 2024Updated last year
- R Interface to the Metabolights REST API☆11Aug 19, 2025Updated 5 months ago
- 🌎🌐Automatic pull requests globally? (big) Getting done everything repetitive that humans don't like and beyond typo clean-ups. - You mi…☆10Feb 8, 2026Updated last week
- ☆11Nov 5, 2024Updated last year
- ☆11Oct 15, 2022Updated 3 years ago
- Repository containing dataset, models and code associated with the CHIME project☆17Aug 22, 2024Updated last year
- Browser extension for accessing open data resources in the biomedical domain☆11Jan 8, 2023Updated 3 years ago
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- Package for pooling the results from (dependent) tests☆12Dec 1, 2025Updated 2 months ago
- ☆12Nov 5, 2024Updated last year
- A basic science lab framework aimed at reproducibility and lab management.☆13Dec 3, 2025Updated 2 months ago
- R interface for Google Pub/Sub☆10Mar 3, 2023Updated 2 years ago
- CRAN Task View: Meta-Analysis☆14Dec 23, 2025Updated last month
- Matriz de distâncias rodoviárias entre os municípios brasileiros☆10Jun 3, 2024Updated last year
- Collection of R Packages that support analysis for the purposes of Health Technology Assessment (HTA)☆11Oct 22, 2021Updated 4 years ago
- code for paper Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering☆13Aug 13, 2024Updated last year
- An interactive web-based tool for analyzing, interrogating, and visualizing network meta-analyses using R-shiny☆15Updated this week
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated last year
- This repository contains a reaction condition selector.☆14Mar 19, 2025Updated 10 months ago