HICAI-ZJU / SciKnowEval
SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models
☆17Updated 5 months ago
Alternatives and similar repositories for SciKnowEval:
Users that are interested in SciKnowEval are comparing it to the libraries listed below
- ☆11Updated last year
- Structured Chemistry Reasoning with Large Language Models☆37Updated 11 months ago
- Pre-trained Language Model for Scientific Text☆45Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆43Updated 4 months ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆36Updated last week
- MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion☆19Updated last month
- A trainable user simulator☆34Updated 7 months ago
- ☆13Updated 5 months ago
- Official Implementation of the Baby-AIGS system☆23Updated 5 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆80Updated last year
- Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.or…☆23Updated 2 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆19Updated 9 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated 2 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆111Updated 7 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆17Updated 5 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆51Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆28Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 3 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆39Updated 5 months ago
- Evaluate the Quality of Critique☆34Updated 10 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- ☆21Updated 2 weeks ago
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆23Updated 10 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆25Updated 5 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆49Updated 5 months ago
- ☆30Updated last year
- Code and data for the ACL2024 paper "InstructProtein: Aligning Human and Protein Language via Knowledge Instruction".☆18Updated 7 months ago
- ☆119Updated 9 months ago
- ☆22Updated 9 months ago