OSU-NLP-Group / awesome-agents4scienceLinks

A curated list of papers on LLMs and agents for scientific research and development

☆69

Alternatives and similar repositories for awesome-agents4science

Users that are interested in awesome-agents4science are comparing it to the libraries listed below

Sorting:

du-nlp-lab / LLM4SR
LLM for Scientific Research Survey
☆98Updated 6 months ago
OSU-NLP-Group / ScienceAgentBench
[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
☆94Updated 2 months ago
mandyyyyii / scibench
☆125Updated last year
openags / Awesome-AI-Scientist-Papers
A collection of resources and papers on AI Scientist / Robot Scientist
☆86Updated 2 months ago
ozyyshr / StructChem
Structured Chemistry Reasoning with Large Language Models
☆40Updated last year
BunsenFeng / Knowledge_Card
Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.
☆21Updated last year
THUDM / SciGLM
SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)
☆79Updated last year
ChicagoHAI / hypothesis-generation
This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…
☆77Updated this week
dongxiangjue / Awesome-LLM-Self-Improvement
A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …
☆88Updated 7 months ago
lemon-prog123 / LongRePS
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
☆16Updated 4 months ago
Ahren09 / AgentReview
Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."
☆79Updated 8 months ago
QizhiPei / MathFusion
MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)
☆27Updated 3 weeks ago
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆27Updated 9 months ago
OS-Copilot / ScienceBoard
Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"
☆100Updated last month
xxxiaol / QRData
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
☆42Updated 5 months ago
zjunlp / KnowledgeCircuits
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
☆151Updated 5 months ago
sail-sg / CPO
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
☆125Updated 4 months ago
zhiyuanhubj / UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
☆100Updated last year
EagleW / Scientific-Inspiration-Machines-Optimized-for-Novelty
Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty
☆83Updated last year
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆36Updated last year
icip-cas / Verifier-Engineering
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
☆61Updated 8 months ago
google / spiqa
Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]
☆61Updated 6 months ago
Reason-Wang / NAT
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆26Updated last year
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆129Updated 10 months ago
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆103Updated last week
weirayao / Retroformer
☆33Updated last year
TIGER-AI-Lab / TheoremQA
The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)
☆32Updated last year
TianHongZXY / RLVR-Decomposed
Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆83Updated 3 weeks ago
Alsace08 / OOD-Math-Reasoning
[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"
☆27Updated last year
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆81Updated last month