OSU-NLP-Group / awesome-agents4science
A curated list of papers on LLMs and agents for scientific research and development
☆54Updated 5 months ago
Alternatives and similar repositories for awesome-agents4science
Users that are interested in awesome-agents4science are comparing it to the libraries listed below
Sorting:
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆85Updated 2 weeks ago
- LLM for Scientific Research Survey☆85Updated 3 months ago
- ☆120Updated 10 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated 2 months ago
- Evaluate the Quality of Critique☆35Updated 11 months ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆79Updated last year
- MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion☆19Updated last month
- Structured Chemistry Reasoning with Large Language Models☆37Updated last year
- Official Implementation of the Baby-AIGS system☆23Updated 5 months ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆41Updated 2 weeks ago
- Papers about scientific hypothesis generation with large language models (LLMs).☆65Updated 2 months ago
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆57Updated 5 months ago
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆66Updated last week
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆116Updated 7 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆31Updated last year
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆75Updated 4 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆59Updated 6 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆80Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆44Updated 5 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated last month
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 8 months ago
- Discovering Data-driven Hypotheses in the Wild☆80Updated 5 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆40Updated 6 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆119Updated last month
- official implementation of paper "Process Reward Model with Q-value Rankings"☆57Updated 3 months ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆13Updated 6 months ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆12Updated last year
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆57Updated 5 months ago