x66ccff / liveideabenchLinks
π€π‘ LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context
β21Updated 2 months ago
Alternatives and similar repositories for liveideabench
Users that are interested in liveideabench are comparing it to the libraries listed below
Sorting:
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"β143Updated last year
- LLM for Scientific Research Surveyβ123Updated last year
- Code/data for MARG (multi-agent review generation)β59Updated 4 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discoveryβ124Updated 5 months ago
- Tree-of-Debate converts scientific papers into LLM personas that debate their respective novelties. To emphasize structured, critical reaβ¦β18Updated 6 months ago
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generationβ59Updated 4 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590β80Updated 6 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award β¦β42Updated last year
- β64Updated 9 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"β60Updated last year
- This repository contains ScholarQABench data and evaluation pipeline.β94Updated 5 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoningβ51Updated last year
- β72Updated 8 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Dataβ45Updated 11 months ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"β86Updated 3 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studiesβ177Updated 3 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examplesβ120Updated last week
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challengesβ27Updated 8 months ago
- A curated list of papers on LLMs and agents for scientific research and developmentβ85Updated last year
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learningβ71Updated 8 months ago
- β22Updated last year
- official implementation of paper "Process Reward Model with Q-value Rankings"β65Updated last year
- A trainable user simulatorβ34Updated 7 months ago
- A collection of resources and papers on AI Scientist / Robot Scientistβ124Updated 4 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)β37Updated last year
- Official Implementation of the Baby-AIGS systemβ24Updated last year
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology Viewβ120Updated 8 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agentβ69Updated 8 months ago
- [ICLR 2025] This is the code repo for our ICLRβ25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewβ¦β50Updated last year
- [EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasonersβ26Updated last year