HICAI-ZJU / SciKnowEval
SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models
☆17Updated 6 months ago
Alternatives and similar repositories for SciKnowEval
Users that are interested in SciKnowEval are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- Structured Chemistry Reasoning with Large Language Models☆38Updated last year
- Pre-trained Language Model for Scientific Text☆45Updated last year
- MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion☆19Updated this week
- ☆50Updated 2 months ago
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆44Updated 5 months ago
- A trainable user simulator☆34Updated 8 months ago
- A curated list of papers on LLMs and agents for scientific research and development☆54Updated 5 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated 3 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆21Updated 10 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆40Updated 6 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆25Updated 6 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆59Updated 6 months ago
- Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality …☆85Updated 6 months ago
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24Updated 11 months ago
- Official Implementation of the Baby-AIGS system☆23Updated 5 months ago
- ☆44Updated 7 months ago
- ☆10Updated last month
- Preparing for ML Interviews.☆11Updated 3 weeks ago
- Code and data for the ACL2024 paper "InstructProtein: Aligning Human and Protein Language via Knowledge Instruction".☆18Updated 8 months ago
- ☆120Updated 10 months ago
- Retrieved Sequence Augmentation for Protein Representation Learning☆51Updated last year
- ☆14Updated 2 months ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆41Updated 2 weeks ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆10Updated 7 months ago
- Official implementation of ICML 2025 paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https:…☆24Updated 2 weeks ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆62Updated 6 months ago
- [AAAI 2024] SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research☆28Updated 9 months ago
- Must-read papers on NLP for science.☆57Updated last year