HICAI-ZJU / SciKnowEval
SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models
☆15Updated 5 months ago
Alternatives and similar repositories for SciKnowEval:
Users that are interested in SciKnowEval are comparing it to the libraries listed below
- ☆11Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆40Updated 3 months ago
- MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion☆16Updated last week
- Structured Chemistry Reasoning with Large Language Models☆35Updated 10 months ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆33Updated this week
- Pre-trained Language Model for Scientific Text☆44Updated last year
- A trainable user simulator☆34Updated 6 months ago
- What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks☆139Updated 8 months ago
- Retrieved Sequence Augmentation for Protein Representation Learning☆50Updated last year
- ☆62Updated this week
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆108Updated 6 months ago
- Code and data for the ACL2024 paper "InstructProtein: Aligning Human and Protein Language via Knowledge Instruction".☆17Updated 7 months ago
- ☆14Updated 5 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆78Updated last year
- Official Implementation of the Baby-AIGS system☆23Updated 4 months ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆11Updated 4 months ago
- LLM for Scientific Research Survey☆78Updated 2 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆30Updated 3 months ago
- ☆38Updated 5 months ago
- [AAAI 2024] SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research☆29Updated 7 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆39Updated 5 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆25Updated 4 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆18Updated 8 months ago
- Source code for "A Deep-learning System Bridging Molecule Structure and Biomedical Text with Comprehension Comparable to Human Profession…☆86Updated last year
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated last month
- ☆116Updated 8 months ago
- Awesome Long-CoT Data☆13Updated last week
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Updated last year
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆14Updated 3 weeks ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆14Updated 2 weeks ago